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HEPATITIS-C VIRUS TESTING 

Technical Field 

The present invention relates to the discovery of new 
types of hepatitis C virus, that we have termed type 3 
(HCV-3) and type 4 (HCV-4) . in particular, it relates to 
the etiologic agent of hepatitis C virus type 3 and 4, and 
to polynucleotides and imnunoreactive polypeptides which 
are useful in immunoassays for the detection of HCV-3 and 
HCV-4 in biological samples; and also to the use of 
antigenic HCV-3 and HCV-4 specific polypeptides in 
vaccines • 

Background of the Invention 

Acute viral hepatitis is a disease which may result 
in chronic liver damage. It is clinically diagnosed by a 
well-defined set of patient symptoms, including jaundice, 
hepatic tenderness, and an increase in the sertim levels of 
alanine aminotransferase and aspartate aminotransferase. 
Serologic immunoassays are generally performed to diagnose 
the specific type of viral causative agent. 
Historically, patients presenting with symptoms of 
hepatitis and not otherwise infected by hepatitis A, 
hepatitis B, Epstein-'Barr or cytomegalovirus were 
clinically diagnosed as having non-A, non-B hepatitis 
(NANBH) by default. 
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For nanj years , the agent of non-A, non-B hepatitis 
rejnained elusive. It has now been established that many 
cases of NANBH are caused by a distinct virus termed 
hepatitis C virus (HCV) . European Patent Application 
EP-A-0318216 discloses cDNA sequences derived from HCV, 
polynucleotide probes and polypeptides for use in 
immunoassays. Further information is provided in 
European Application EP-*A»0388232. 

The HCV genome encodes a large polyprotein precursor, 
which contains structural and non-structiiral regions. The 
single protein is apparently cleaved into a variety of 
proteins after production* Most of the structural and 
non-*structu3ral proteins have now been identified from in 
vitro BNA translation and expression as recombinant 
proteins. The c and E regions encode for nucleocapsid 
structural proteins and for envelope structural proteins, 
respectively. At least five additional regions follow, 
which encode for non-structural (NS) protein of undefined 
function. The organisation is believed to be as follows 
(A* Albert!^ Journal of Hepatology, 1991; 12; 279 to 282) 

5' 3' 
NCR: C : El : E2 r NSl : NS2 : NS3 : NS4 : NS5 

Certain immunoreactive proteins have been described as 

recombinant proteins, for example C22 (in the core 

region) , C33 (in iTS3 region) , 5-1-1 and ClOO (both in the 

NS4 region) , and NS5 (NS5 region) . Diagnosis of 

hepatitis c is still largely based on methods which detect 

antibodies against the product of the C-100 clone. This 
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clone wa^ ligated with overlapping clones to produce a 
larger viral antigen (Cioo) corresponding to part of the 
NS3-NS4 genomic region, CIOO was then fused with the 
human superoxide dismutase (SOD) gene, expressed in use as 
a large recombinant fusion protein (Cioo-3) and used on 
solid phase to develop radio-labelled (RIA) and 
enzyme-linked immunosorbent assays (ELISA) . 

Polynucleotides useful for screening for HCV are 
disclosed in European Patent Specification EP-A-0398748* 
European Patent Specification EP-A-0414475 purports to 
disclose the propagation of HCV in culture cells and the 
production of antigens for use in diagnostics « European 
Patent Specification EP-A--0445423 discloses an improved 
immunoassay for detecting HCV antibodies. 

Blood banks in the united Kingdom have recently begun 
routine testing of blood donors for antibodies to 
components of HCV. One assay involves the detection of 
HCV antibodies to clDO-3 polypeptides- The ClOO-3 
antibody recognises a composite polyprotein antigen within 
non-^structural regions of the virus and is a consistent 
marker of HCV infection. However, in acute infections 
this antibody is unreliable because of the delay 
(typically 22 weeks) in seroconversion after exposure. 
Furthermore, the C100-'3 antibody test lacks specificity 
for the hepatitis C virus* 

Second generation antibody tests employ recombinant 
antigens or synthetic linear peptides representing 
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Structural antigens frpm the highly conserved core region 
of the virus as well as non-structural antigens. However, 
it is found that some second-generation ELISA tests can 
yield false-positive reactions. The recombinant 
iHummoblot assay CRlBA-2) incorporating four antigens from 
the HCV genome, provides a method for identifying 
genuine-anti-HCV reactivity. However, the result can be 
"indeterminate". The present workers have reported (The 
Lancet, 338; Oct. 19, 1991) varying reactivity of 
HC7-positive blood donors to 5-1-1, ClOO, C33C and C22 
antigens, and compared these with the results of the 
direct detection of HCV RNA present in the blood samples 
using polymerase chain reaction (PCR) to amplify HCV 
polynucleotides. However, the work demonstrates that the 
unambiguous diagnosis of HCV infections is not yet 
possible. 

Recently there has been discovered a second type of 
HCV (References 1, 2) called K2 that differs considerably 
in sequence from the published prototype (Reference 3) or 
the first type Kl sequences (References 4 and 5) . 



Summary of the Invention 



The present invention is based on the discovery of a 
previoxisly unknown type 3 and 4 variants of HCV, by a 
comparison of sequences amplified by PCR in certain 
regions of the HCV genome and confirmed by phylogenetic 
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analysis* The invention has thus identified 
polynucleotide sequences and polypeptides which are 
and HCV-4 specific* These may be used to diagnose HCV-3 
and HCV«4 infection and should thus be included in any 
definitive test for HCV infection. 

One aspect of the invention provides polynucleotide 
sequences unique to hepatitis C virus types 3 and 4 (HCV-3 
and HCV-4). The sequences may be RNA or DNA sequences, 
in principal any HCV-3 or HCV-4 specific polynucleotide 
sequence from non-coding, core. El, E2 or NS1--5 genome 
regions can be used as a hybridisation probe. The 
sequences may be recombinant (i.e. expressed in 
transformed cells) or synthetic and may be comprised 
within longer sequences if necessary. Equally, deletions, 
insertions or substitutions may also be tolerated if the 
polynucleotide may still function as a specific probe. 
Polynucleotide sequences such as core, NS3, NS4 and NS5 
which code for antigenic proteins are particularly useful. 

Another aspect provides an antigenic HCV-3 or HCV-4 
specific polypeptide, particularly from the core, NS3, NS4 
or NS5 regions (e.g. the HCV-3 or HCV-4 counterparts of 
ClOO polypeptide, 5-1-1 polypeptide, C33 polypeptide or 
C22 polypeptide or epitopes thereof) or polypeptides 
including these antigens. 

A further aspect of the invention provides labelled 
antigenic HCV-3 or HCV-4 specific polypeptide (or mixtures 
thereof, particularly from the core and NS4 regions) for 
use in an immunoassay. 
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A further aspect of tne invention provides antibodies 
to HCV'-a or HCV-4 specific antigens, particularly 
monoclonal antibodies for use in therapy and diagnosis. 
Thus labelled antibodies may be used for in vivo 
diagnosis. Antibodies carrying cytotoxic agents may be 
used to attack HCV-3 or HC7-4 infected cells • 

A further aspect of the invention provides a vaccine 
comprising immunogenic HCV--3 or HCV-4 specific polypeptide. 

The or HCV-4 specific polynucleotide sequences 

may be used for identification of the HCV virus itself 
(usually amplified by PCR) by hybridisation techniques. 

Oligonucleotides corresponding to variable regions in 
the NS-4 region could be used for type-specific PCR. 
outer sense and inner sense primers may be used in 
combination with the two conserved antl^^sense primers for 
a specific detection method for HCV types 1, 2,3 and 4. 

Immunoreactive HCV-3 or HCV-4 specific polypeptides 
(particularly from the core and NS4 regions) may be used 
to detect HCV-3 and HCV'-4 antibodies in biological 
samples, and may also provide the basis for immunogens for 
inclusion in vaccines (especially the El polypeptide) . 
The term "peptide" is used herein to include epitopic 
peptides having the minimum number of amino acid residues 
for antigenicity, through oligopeptides, up to proteins. 
The peptide may be a recombinant peptide expressed from a 
transformed cell, or could be a synthetic peptide produced 
by chemical synthesis. 
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In particular, the invention allows blood donor 
screening by conventional assays (using HCV type i encoded 
antigens) to be supplemented with a second test that 
contains two oligopeptides corresponding to first and 
second antigenic regions found in the NS-4 sequence of HCV 
type 3 (positions 1691 to 1708; sequence 
KPALVPDKEVLYQQYDEM and positions 1710 to 1728; sequence 
ICSQAAPYIEQAQVIAHQF) and two derived from the equivalent 
regions of HCV type 2, R(A/V)V(V/I) (A/T)PDKE(I/V)LyEAFDEM 
and ECAS(iyR)AALIEEGQR(M/I)AEML. 

The corresponding HGV-4 antigens from substantially 
positions 1691 to 1708 and 1710 to 1728 may be used for 
HCV--4 detection. 

Thus, the present invention has also identified 
corresponding polynucleotide and polypeptide sequences 
which may be used to identify hepatitis C type 2 viral 
infection. 

Production and detection of the antigen-^antibody 
immune complex may be carried out by any methods currently 
known in the art. For example, a labelling system such 
as enzyme, radioisotope, fluorescent, luminescent or 
chemiluminescent labels may be employed, usually attached 
to the antigen. Labelled anti-antibody systems may also 
be used. The recombinant: antigen may be either used in 
liquid phase or absorbed onto a solid substrate. 

Oligopeptides corresponding to the antigenic regions 
of all three major types may also be used separately to 
serologically distinguish individuals infected with 
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different HCV types. Such an assay could be in the format 
of an indirect enzyme immunoassay (EIA) that used sets of 
wells or beads coated with oligopeptides of the two major 
antigenic regions for KCV types 4, 3 and 2, and with type 
1 (KPA(V/I)IPDREVLYIIEFDEM and RPAV(X/V) PDREVLYQEPDEM and 
ECSQH£PYIEG(M/A)AEQF) • Minor degrees of cross-reactivity, 
should they exists can be absorbed out by dilution of the 
test serum in a diluent that contained blocking amounts of 
soluble heterologous-type oligopeptides, to ensure that 
only antibody with type-^specif ic emtibody reactivity bound 
to the solid phase, 

immunogens for use in vaccine formulations may be 
formulated according to techniques currently known in the 
art, including the use of suitable adjuvant and 
immune-stimulation systems* 

Finally, the present invention also encompasses assay 
kits including polypeptides which contain at least one 
epitope of HCV-3 or antigen Cor antibodies thereto), 

as well as necessary preparative reagents, washing 
reagents, detection reagents and signal producing reagents. 

Description of the Drawings 

Examples of the invention will now be described by 
way of example only. 

Figures 1 and la give cDNA sequences obtained from 
PCR amplification of a region -255 to -62 of the 5' 
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non-coding region of HCV samples from 18 blood donors and 
a comparison with previously published nucleotide 
sequences <see Table 2) ; sequence numbering corresponding 
to the prototype HCV-1 sequence (ref.4) and previous 
designations of type 1 or 2 being indicated; 

Figure 2 is a phylogenetic analysis showing 
clustering of the sequences into three types viz; HCV-1, 
HCV-2 and HCV-3 for the S' NCR results of Figure 1 using 
the maximum likelihood algorithm, shown as an unrooted 
tree. Numbers 1*18 in full circles correspond to blood 
donor sequences E-bl through E-bl8. Niamber^ 1 to 26 in 
open circles correspond to the previously published 
sequences identified in Table 2 . ; 

Figure 3 is a comparison of deduced amino acid 
sequences in the NS-5 region of blood donors (E-bl, E-b2, 
E-b3, E-b7 (type 3) and E-bl2 (type 2) with those 
previously published (Table 2). Amino acid residue 
numbering follows that of the HCV-1 polyprotein (4) and 
uses single letter amino acid codes; 

Figure 4 is a phylogenetic analysis of the NS-5 
region using the maximum likelihood algorithm, shown as an 
iinrooted tree- symbols are as described for Figure 2; 

Figure 5 is a comparison of deduced amino acid 
sequences in the NS-3 region of blood donors (E-bl, E^b2, 
E-h6, E-b7 (type 3) with those previously published 
(Table 2). Group 1/1: amino acid sequence of fl^ f3, f4, 
f5, h2/h3, h4 (one), i2 , 13, i4, pi, p2 ; Group 1/2: 
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amino acid sequence of iS; Group 1/3: amino acid sequence 
of h2, h3, h4 (one), hS, f2, p3, il; croup X/4: amino 
acid sequence of hi (one); Group 1/5; amino acid sequence 
of hi (one) - Numbering, symbols and abbreviations are as 
described for Figure 3; 

Figure 6 is a phylogenetic analysis of the NS-3 
region using the maximum likelihood algorithm, shown as an 
unrooted tree. Representative nucleotide sequences of the 
5 groups of type 1 sequences shown in Figure 5 coded as 
follows: 19 (full circle) 13; 20 (full circle) i4; 21 
(full circle) hS; 22 (full circle) h3; 23 (full circle) 
hi. symbols are as described for Figure 2; 

Figure 7 is a con^arison of deduced amino acid 
sequences in the core region of blood donor E-bl (type 3) 
with those previously published (Table 2) . Numbering, 
symbols and abbreviations are as described for Figure 3; 

Figure 8 is a phylogenetic analysis of the core 
region using the maximum likelihood algorithm, shown as an 
unrooted tree. Symbols are as described for Figure 2. 

Figure 9(a) shows nucleotide, and Figure 9(b) shows 
deduced amino acid sequences of HCV type 3 variants 
amplified from 5 Scottish blood donors (nos. 40, 38, 36, 
26 and 1787) in the putative NS-4 region of HCV 
(nucleotides and amino acid residues numbered as in Choo 
et al., (1991). Nucleotide codes: G: guanidine; C: 
cytidine; A: adenine? U: uridine; amino acid codes: A: 
alanine; R: arginine; N: asparagine; D: aspartic acid; C: 
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cysteine; Q: glutamine; E: glutamic acid; G: glycine; H: 
histidine; I: isoleucine; L: leucine; K: lysine? M: 
methionine; F: phenylalanine; P: proline; S: serine; T: 
threonine; w: trytophan; V: tyrosine; V: valine.".": 
sequence not determined; difference from consensus shown 
in bold. 

Figure 10(a) shows a comparison of amino acid 
sequences between residues 1679 and 1768 (Choo et al*, 
1991) of the three major variants of HCV, T16, T42, T77, 
TlBOl, T1825: Scottish blood donors infected with HCV 
type 1; T351: Scottish blood donor infected with HCV type 
2; T59, T940^ T810: Scottish blood donors infected with 
HCV type 2; T40, T38, T36, T26, T1787: Scottish blood 
donors infected with HCV type 3; and Figure 10(b) shows 
the derivation of consensus sequences for HCV types 3, 2 
and 1 oligopeptide series. Differences from consensus 
shown in bold. Amino acid codes: A: alanine; R: 
arginine; N: asparagine? D: aspartic acid; C; cysteine; 
Q: glutamine; Et glutamic acid; G: glycine; H: histidine; 
I: isoleucine; L: leucine; K: lysine; M: methionine; F: 
phenylalanine; P: proline; S: serine; T: threonine; W: 
tryptophan; Y: tyrosine; V: valine; ".": not determined. 

Figures 11(a) to 11(c) show amino acid sequences of 
nonameric oligopeptides used for epitope mapping, derived 
from consensus HCV type 3, type 2 and type l sequences 
respectively. Amino acid codes: A: alanine; R: arginine; 
N: asparagine; D: aspartic acid; C: cysteine; Q: 
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glutamine; E: glutamic acid; G: glycine; H: histidine; I: 
isoleucine; L: leucine; K: lysine; M: metSiionine; F: 
phenylalanine; P: proline; St serine; T: threonine; W: 
tryptophan; Y: tyrosine; V: valine; 

Figures I2a^ 12b and 12c show antibody reactivity of 
three sera from blood donors infected with HCV type 3 with 
HCV type 3-encoded oligopeptides in the antigenic region 
of NS-4 (seguences 1-82 shown in Figure iia) . Antibody 
reactivity to oligopeptides (x-axis) , recorded as optical 
densities in the range from -01 to 0-75 (and >0.75) 
recorded on the y-axis. 

Figure 13 is a comparison of divergent HCV sequences 
with representative type 1, 2 and 3 sequences in variable 
regions of the 5''1ICR* Sequences from -25S to -246^ -21S to 
-186, -115 to -102 and -69 to -62 identical to prototype 
sequence. sequence identity with HCV-1; gap 

introduced in sequences to preserve alignment; 
sequence not determined* Origins of sequences : Eg- 1-33 
: Egypt; NL-'26 : Holland; HK- 1«4 : Hong Kong; IQ-48: 
Iraq; XX-96 : xxxxx* Figures in parentheses number each 
non-identical sequence. 

Figure 14 is a phylogenetic analysis of the 5 'NCR 
region using the maximum likelihood algorithm, shown as an 
unrooted tree. Sequences X-17 in solid circles are 
numbered as in Figure 13; previously published sequences 
numbered as in table 1 of (992) , Scottish blood donor 
sequences Eb-1-Eb*-12 numbered 51-62 in hollow circles • 
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For clarity, only non- identical sequences are shown in 
tree; e.g. Sequence 1 corresponds to those found in 
samples Eg-16 and Eg-29 etc. (Figure 1) . Hollow squares 
are published sequences from Zaire; Hollow small circles 
are sequences from South Africa; Hollow small solid 
circles are sequences obtained elsewhere in the world. 

Figure 15 A/B is a comparison of nucleotide (A) and 
amino acid (B) sequences in the core region- Symbols as 
for Figure 13. Single letter amino acid codes are used; 

Figure 16 is a phylogenetic analysis of part of the 
core region using the maximum likelihood algorithm, shown 
as an unrooted tree, sequences are numbered as in Figure 
14; sequence 30 is that of HC-J8 (Okamato et al. Virology 
188:331 -341) ; and 

Figure 17 shows cleavage patterns for A) Haelll/Rsal 
and B) ScrFI in 5 'NCR. 

I) ANALYSIS OF HEPATITIS C VIRUS AND 
PHYLOGENETIC RELATIONSHIP OF TYPES 1, 2 and 3 
Introducrbion 

Sequence analysis of the 5' non coding region of 
hapatitis C virus (HCV) amplified from the plasma of 
individuals infected in Britain revealed the existence of 
three distinct groups of HCV, differing by 9-14% in 
nucleotide sequence. Two of the groups identified were 
similar to those of HCV variants previously termed type 1 
and type 2, while the third group appeared to represent a 
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novel virus type. Sequence comparisons were then made 
between the three virus types in other regions of the 
viral genome. In the NS-5 region, a high degree of 
nucleotide and amino acid sequence diversity was observed, 
with samples classified here as type "3" again forming a 
distinct group that was phylogenetically distinct from 
type 1 and type 2 variants. Type 3 sequences were 
similarly differentiated in the NS-3 and core regions from 
HCV type 1 sequences. The designation of virus types, 
including an observed sub-division of type 1 sequences 
into geographically distinct variants is discussed in 
relation to the new sequence data obtained in this study. 

Discussion 

Replication of nucleotide sequences by polymerase 
chain reaction (PGR) is a recently established tecamique. 
synthetic complementary primer sequences are hybridised to 
single-stranded DNA on either side of a genome region to 
be copied. The second strand is built up under the action 
of a heat-stable polymerase in the region between the 
primers. Heating then dissociates the two-strands and the 
replication process starts again. The PGR technique 
allows tiny amounts of polynucleotide to the amplified 
provided that there is sufficient sequence information to 
synthesise the primar sequences. 

The major problem associated with the use of the PGR 
to assess sequence variation using the PGR is the 
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possibility that mismatches between the primers and the 
variant sequence will prevent amplification. We have used 
several strategies to overcome this problem. For initial 
virus detection, we used primers in the 5 'NCR, which are 
reported to highly conserved amongst type 1 variants (4, 
11, 13, 16, 23, 24, 26, 33), and between Kl and K2 (23). 
Sequence analysis of the blood donors allowed the 
identification of type 1 and type 2 variants by comparison 
with published sequence data. This analysis also revealed 
the existence of a third "type" of HCV that appeared to be 
as distinct from type 1 as type 2 was (Figs, 1,2; Table 3); 
Based on our initial tentative classification, we sought 
corroboration of our findings in other (coding) and more 
variable regions of the viral genome* 

Analysis of the NS-5 region, which was based on 
several sequences of each of the three types (Figs, 3, 4; 
Table 3), confirmed the existence of 3 major groups, with 
type 3 sequences forming a relatively homogeneous group 
that was quite distinct from types 1 and 2. The proposed 
separation of type 1 sequences into FT and Kl "sub- types" 
and type 2 sequences into K2a and K2b is supported by this 
analysis, in which the single type 2 blood donor sequence 
obtained in this study appears most similar to K2b. 
Differentiation of HCV type 1 sequences into two groups is 
also clearly shown in the core (Fig- 7) and NS-3 regions 
(Fig. 5), in both cases with the type 3 sequences appearing 
considerably more distant* 
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The clxistering of phylogenetically distinct groups, 
tfeeir mixed distributions in a single geographic area (l, 
7, 23, 27, 35) and our own finding of dual or triple 
infections in individual haemophiliacs all strongly 
suggest that the three types described here are distinct 
viruses rather than simply representing geographical or 
epidemiologically clustered variants of a single, highly 
variable but monophyletic group. 

our own phylogenetic analysis of the 5 'NCR reveals 
the existence of three distinct groups. This contrastjs 
with analyses of coding region, where there appears to be 
a very prominent differentiation of type i sequences into 
two •* subtypes"* However, unlike type 2 2ind 3 variants, 
the two subtypes are geographically distinct, one sub-type 
comprising sequences obtained exclusively from Japanese 
patients, and the other comprising predominantly 
USA/European sequences (Table 2) * Indeed the only 
exception to this geographical classification is the HC--J1 
sequence (26) ; one apparent exception (Pt-l) was obtained 
from a Japanese haemophiliac treated with imported factor 
VIII of USA origin (7,23), which is likely to have 
contained HCV variants corresponding to the other 
sub-type • There is insufficient sequence data to indicate 
whether the two proposed type 2 subtypes, K2a and K2b 
(7,23) represent geographically distinct variants. 

The genomic organisation of HCV corresponds to that 
of flaviviruses and pestiviruses , with a single open 
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reading frame encoding a polyprotein that is stibsaquently 
cleaved into structural and non-structural proteins. Weak 
sequence homologies have been detected with several other 
virus groups that have positive^sense RNA genomes 
(19,21). Although the overall degree of sequence 
dissimilarity between types 1, 2 and 3 cannot be measured 
by comparison of the small regions of sequence analysed in 
this study, a rough estimate of the extent of divergence 
in protein coding regions is given by an examination of 
the divergence of the pairtial core sequence. This shows 
that the difference between HCV type 1 and type 3 core 
region (approximately 10% amino acid sequence divergence) 
is comparable to that which exists between different 
serotypes of the flavivirus, tick-borne encephalitis virus 
(14%; ref,20), but lower than that which is found between 
serotypes of a mosquito borne flavivirus, dengue fever 
virus (33%) , and the West Nile (WN) subgroup (28-43% 
divergence). The 5 'NCR sequences of the different members 
of WN subgroup are also considerably more diverse than 
those of the three types of HCV (=50% similarity; ref.5), 
although within each of the members e.g. Murray Valley 
encephalitis virus, the 5 'NCR is extremely well conserved 
(>95% similarity; ref.5). On the basis of these 
analogies, we speculate that the major types of HCV 
represent distinct "serotypes", each capable of human 
infection irrespective of the immune response mounted 
against other HCV types. 
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METHODS 



samples . Plasma from 18 different blood donors <E-bl 
through E-bl8) , that were repeatedly reactive on screening 
by Abbott 2nd generation enzyoeimmunoassay (EIA) , and 
confirmed or indeterminate by a recombinant immunoblot 
assay (RIBA; ortho; ref I) were the principal samples used 
in this study. Sequences in the NS-3 region from 5 
anti-HCT positive IVDUs (abbreviated as il - iS in ref. 
31), 5 haanophiliacs who had received non-heat treated 
Clotting concentrate, and who were also anti-HCV positive 
(hl-h5) , 3 pools of 1000 donations collected in 1983 
(pl-p3>. and 5 separate batches of commercially available 
non-heat treated factor VIII (fl - f5) correspond to those 
described previously (31) . 

Primers. The primers used for cDNA synthesis and 
polymerase chain reaction (PCR) are listed in Table 1. 
They were synthesised by Oswel DNA Service, Department of 
Chemistry, university of Edinburgh. 

«m E^rtraction and PCR. HCV virions in 0.2-1.0 ml volumes 
of plasma were pelleted from plasma by ultracentrifugation 
at 100,000g for 2 hours at RNA was extracted from 

the pellet as previously described (2,31) . First strand 
CDMA was synthesized from 3ul of RNA sample at 420C for 30 
min. with 7 units of avian myeloblastosis virus reverse 
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transcriptase (Promega) in 20ul buffer containing 50 mM 
TriS'-HCl (pH 8.0), SinM MgCl2, 5 mM dithiothreitol, 50 mM 
KCl, 0.05 ug/Ul BSA, 15% DMSO, 600 uM each of dATP, dCTP, 
dGTP and TTP, 1.5 uM primer and 10 U RNAsin (Promega). 

PGR was performed from 1 ul of the cDNA over 25 
cycles with each consisting of 25 sec. at 94<^c, 35 sec. at 
50^0 and 2.5 min. at 68^0. The extension time for the 
last cycle was increased to 9.5 min. The reactions were 
carried out with 0-4 unit Tag polymerase (Northximbria 
Biologicals Ltd.) in 20 ul buffer containing 10 mM 
Tris-HCl, pH 8.8, 50 mM KCl, 1.5 mM MgCl2, 0.1% Triton 
X-100, 33 uM each of dATP, dCTP, dGTP and dTTP and 0-5 uM 
of each of the outer nested primers. One ul of the 
reaction mixture was then transferred to a second tube 
containing the same medium but with the inner pair of 
nested primers, and a further 25 heat cycles were carried 
out with the same programme. The PCR products were 
electrophoresed in 3% low melting point agarose gel (IBI) 
and the fragments were detected by ethidium bromide 
staining and UV illumination- For sequence analysis, 
single molecules of cDNA were obtained at a suitable 
limiting dilution at which a Poisson distribution of 
positive and negative results was obtained (30). 

Direct Secmencincr of PGR Products . The PCR products were 
purified by glass-milk extraction ("GeneClean"; BiolOl, 
Inc.). One quarter of the purified products was used in 
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sequencing reactions with T7 DNA polymerase (Sequenase? 
United States Biologicals) perfonned according to the 
manufacturer's instructions except that the reactions were 
carried out in 10% DMSO and the template DNA was heat 
denatured before primer annealing. 

Phvlocrenetic Methods . The sequences were compiled by 
version 2.0 of the programs of Staden C32) and analysed by 
programs available in the University of Wisconsin Genetics 
Computer Group sequence analysis package^ version 7.0 
(6)^ Phylogenetic trees were inferred using two different 
programs available in the PHYLIP package of Felsenstein 
(version 3,4 June 1991; ref*9). The program DNAML finds 
the tree of the highest likelihood (the maximum likelihood 
tree) given a particular stochastic model of molecular 
evolution Eind has been shown to perform well in simulation 
studies (28) . In the analyses performed here the global 
(G) option was used as this searches a greater proportion 
of all possible trees. The second program used was 
NEIGHBOR which clusters (following the algorithm of Saitou 
& Nei; ref»29) a matrix of nucleotide distances previously 
estimated using the program DNADIST (which itself was set, 
using the D option, to use the same stochastic model as 
underlies DNAML in order to estimate distances corrected 
for the probabilities of multiple substitution) . In all 
cases the maximum likelihood and neighbour joining 
procedures produced congruent trees and thus only the 
former have been presented here* 
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To establish the interrelationships of the major 
types of HCV, we have separately analysed several regions 
of the viral genome that differ in sequence variability 
and evolutionary constraint. Thus the oonclusions drawn 
from the sequence comparisons are not subject to spurious 
evolutionary phenomena that may affect a particular 
region* However, one problem with the analysis presented 
here was the absence of a viral sequence that was 
sufficiently distantly related to HCV to serve as an 
out-group • Thus , although we describe the 
interrelationships of different sequence variants of HCV, 
it should be stressed that we have no means of deciding 
which sequence is ancestral to the others • The trees are 
thus drawn in the less familiar un-rooted form to indicate 
this. 

RESULTS 

1) Analysis of the 5^ non-coding region . Samples were 
obtained from 18 blood donors that were repeatedly 
reactive in the Abbott 2nd Generation enzyme immunoassay 
and which were confirmed or indeterminate in the Chiron 
4-IU:BA (E-bl through E-bl8, ref.lO). HCV sequences 
present in stored plasma samples from each donor were 
amplified with primers corresponding to sites in the 5 'NCR 
(12r25) that are well conserved between all known HCV type 
1 and type 2 variants (4,11,13,16,23,24,26,33). 
Sequencing of the PCR product, after limiting dilution to 
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isolate sijigle moleoules of cDMA before amplification, 
suLlowed approximately 190 bps in the centre of the region 
to be compared with equivalent published sequences (fig.l). 

Within the sequences/ constant as well as variable 
regions can be found. Six sequences from donors E-*bl3 
through E-bl8 closely resembled those previously described 
as type X (4,ll,13rl^#23,24,2e,33) and others resembled 
type 2(23) sequences (E-b9 through E-bl2). However, eight 
sequences (E^bl through E-b8) were distinct from both 
types, and have been provisionally termed type 3, 
Division of the sequences into three types is supported by 
formal phylogenetic analysis using the maximum likelihood 
(fig. 2) and neighbour joining algorithms (data not shown) 
of the blood donor sequences along with previously 
published sequences (identified in table 2} . Sequence 
variability within the three groups is in each case 
considerably less than that which separates the types. No 
sequence intermediate between the three types were found* 
This tree shows that the provisionally identified type 3 
group is equally distinct from type i as is type 2, Using 
the ONAML model, the corrected distances between sequences 
within each type were in each case less than 3%* Between 
groups, they ranged from 9% (between type 1 and type 3, 
and between type 1 and 2) , to 14% between type 2 and type 
3 (table 3)* 
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2) Analysis of the NS-5 Region . The nucleotide sequence 
of the NS-5 region has been found to vary significantly 
between the previously described Kl and K2 variants of HCV 
(7), To investigate whether type 3 sequences were equally 
distant from the other two types in this region as well as 
in the 5 'NCR, we compared sequences from four type 3 blood 
donors (E-blr E-b2r E-b3 and E-*b7) and one type 2 donor 
(E-bl2) with previously published sequences (Fig. 3; 
Pig. 4; table 3) • 

A remarkable variation was observed between sequences 
of the three types in this region. Again, type 3 
sequences form a separate group from type 1 and type 2 in 
this region. However, unlike the 5 'NCR, there appear to 
be subdivisions within the type i and type 2 groups. Type 
1 sequences are split between those found in Japanese 
infected individuals (e.g. HCV-J; HCV-BK; sequence numbers 
12, 13, 16-20 in table 2) and those of U.S.A. origin 
(HGV*1, Pt-1, H77, H90; sequence numbers 1-4? fig. 4). 
There is also some evidence for a split between type 2 
sequences, those corresponding to their previous 
designation as K2a (7) appearing distinct from type K2b 
sequences and the Scottish blood donor, E-bl2- 

Table 3 shows that the average nucleotide distances 
between the two groups of HCV type 1 sequences is 25% 
(indicated here as type la [USA] and type lb [Japanese]), 
with variation of only 4--7% within each group- The 
nucleotide sequence divergence within the two type 1 
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groups is similar to that which exists between K2a and K2b 
(table 3). However, both of these distances are 
considerably less than those which exist between type l 
and type 2 sequences (52-62%) , and type 3 (48*49%) , and 
the distance between type 2 and type 3 sequences (53-60%) • 

3) Analysis of the region - Ainplification reactions 

were carried out using previously published primer 
sequences in the NS-3 region (37>, and a pair of 
empirically derived inner primers (31) • Although these 
primers amplified HCV sequences from a high proportion of 
anti-c--iQO positive sera from haemophiliacs (31) , they 
were less effective with sera from IVDtJs (31) , and with 
blood donor samples (3 positive out of 15 tested; data not 
shown) . Two conserved sites in the amplified fragment 
were identified by sequence analysis of the NS-3 region 
from the haemophiliac and IVDU patients r and two new 
primers corresponding to these were specified (207, 208; 
Table 1) . The combination of 288-208 (first round) and 
290-207 (second round) primers successfully amplified 
samples from four donors infected with HCV type 3 (E-bl, 
E-b2jr E-b6 and E-b7) but none of those infected with HCV 
type 2 (data not shown) . This enabled a comparison of the 
new type with our own (31) and previously published type 1 
sequences (figs. 5, 6; table 3) . For clarity, only seven 
of the type 1 sequences obtained in this study (E«bl6, 
E-bl7, i3, i3, h5, h3 and hi) are shown in the tree. 
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These sequences are representative of the range of 
variation found in this region in individuals infected in 
Britain; coxaparison of the tree previously published (31) 
with fig .6 shows that the former forms a very small 
component of the overall tree obtained once Japanese type 
1 and type 3 sequences are added ^ 

The maximum likelihood tree shows that type l and 
type 3 have diverged considerably from each other. As was 
found in the NS-5 region, subtypes of type 1 sequences are 
found in NS-3. Again, sequences of Japanese origin 
(HCV-J, HCV-BK and JH) are distinct from the prototype 
(FT) sequence, and those found in Scottish blood donors 
(E-bl6, E-bl7, pl^3), IVDUs (il-5) and haemophiliacs 
(hl-5) , all of which correspond to the prototype sequence 
(fig»5)* However, the average subtype difference (23%) is 
lower than those that exist between HCV-l and HCV-J with 
the four type 3 sequences (37-43%) • As reported 
previously (31), the majority of nucleotide substitutions 
that exist between type 1 sequences are silent (i.e. do not 
affect the encoded amino acid sequence) , while numerous 
amino acid substitutions exist between type 1 and type 3 
sequences (fig.5). The analysis of the NS-3 region 
includes the sequence of clone A (35) which was obtained 
from Japanese patients with NANB hepatitis, and which was 
reported to be distinct from existing HCV type 1 
sequences. In Fig. 6, this sequence appears to be 
distinct from both HCV type l and type 3, with corrected 
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sequence distances of 33*43% and 36% respectively. 
Although it is not possible to assign this sequence to any 
known group at this stage, these distances are not 
inconsistent with the hypothesis that it represents a type 
2 sequence r or an equally distinct novel HCV type. 

4) Partial Secmence of the Putative Core Region of HCV , 

The region encoding the putative core protein is 
comparatively well conserved in its nucleotide sequence 
between Jcnown type i variants, showing nucleotide and 
amino acid sequence similarities of 90-98% and 98-99% 
respectively (11,. 24) . Part of the core region from the 
blood donor Ebl, who has type 3 sequences in other regions 
analysed was amplified with primers 410 and 406 and 
compared with previously published type 1 sequences 
(fig. 7, 8; table 3)- This analysis confirms that the type 
3 sequence was distinct from those of type 1, and. again 

there was a prominent subdivision of type 1 sequences into 

Japanese (HCV- J, HCV-BKr HC-J4^ JH and J7) and 

USA/European (HCV-l, H77, H90, GMl^ 6M2) sequences. As 

was found in MS*3^ very little amino acid sequence 

variation is found in the core regions of type 1 

sequences? almost all of the nucleotide differences 

between the two groups are at "silent" sites. By 

contrast, the type 3 sequence shows 7^8 amino acid 

substitutions on comparison with type 1 sequences. 
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TABLE I 



SEQUENCES AND SOURCES OF PRIMERS USED FOR AMPUHCATION OF HCV GENOME. 



410 
>406 



CORE 
CORE 



Name 


Region 


Position 
of S'taasc-" 


Sense" 


209 


5'NCR 


8 




211 


5'NCR 


-29 




939 


5*NCR 


-297 




940 


5'NCR 


-279 


+ 



410 
-21 



ATGTACCCCATGAGCTCGGC 
AGGTCTCGTAGACCGTCCATCATGAGCAC 



Ret". 



ATACrCGAGGTGCACGGTCTACGAGACCT (12) 

CACrCTCGAGCACCCTATCAGGCAGT (12) 

CTGTGAGGAACTACTGTCTT (25) 

TTCACGCAGAAAGCGTCTAG (25) 



^8 


NS-3 


4951 




CCGGCATGCATGTCATGATGTAT 




NS-3 


4932 




GTATTTGGTGACTGCKJTGCGJG 


208 


NS-3 


4662 


+ 


TCTTQAATnTGGGAGGGCCSTCrr 


207 


NS-3 


4699 


+ 


CATATAGATGCCCACrrcCTATC 



007 
J20 



NSa 
NS-» 



HCV type 3 fl 

221 NS-4 

008 NS«4 



(31) 
(31) 



5293 
5278 



4858 
4878 



AACTaSAGTATCCCACTGATCAAGTICCACAT 
CACATGTGCmCCCCCAGAA 



+ GGACXTACGCCCCTICTATA 
+ TCGGTnKKJGCCTGTCCAAAATG 



HCV type 2 : 

281 NS-4 4858 

509 NS-4 4878 



GGTCCCACCCCTCTCCTGTA 
CCGCrrGGGTTCC(3TTACCAACG 



HCV type 1: 

253 NS-4 4858 

1% NS-4 4878 



GGGCCAACACCCCTGCTATA 
CAGACTGGGCGCCGTrcAGAATG 
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!Cable 


1 cont.. 




4 


342 


NS-5 


8304 




CKjCCHjAATTCCTGCrrCATAOCCTCCCjTGAA 


555 


NS-5 


8227 




CCACGACTAGATCATCTCCG 


243 


NS-5 


7904 


+ 


TGGCKlATCCCGTATGATACCCGCTGCrrrGA (7) 


554 


NS-5 


7935 




CTCAAOZGTCACTOAACAGOACAT 



■Pbsiiion of 5'base idanvc lo HCV genomic sequence in refc no. (4) 
"Oiieaiaiion of primer sequence (+: sense: -: and-sense) 



* * 

t Abbtevtattons: A: adenine, c: cyddine: G: guanidine, T: thymidine. 

f Separate sense pnmeis reqtdied to enable amplificaiion of each HCV type. 



4 
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TABLE 2 

SOURCE AND CITATION OF PREVIOUSLY PUBUSHED HCV SEQUENCES USED IN THIS STUDY 



Geographical 



No. 


Type 


Abbreviation 


Source 


Reference 


Ref. 


I 


1 


HCV- 1 


U.S.A. 


Chooetal.. 1991 


(4) 


2 


1 


Pt-1 


Japan 


Nakao et al.. 1991 


(23) 










Enomoto et al., 1990 


(7) 


3,4 


1 


H77. H90 


U.S.A. 


Ogaiaefa/.. 1991 


(24) 


5,6 


1 


GM-1. CM-2 


Gemiany 


^v&h&etaL, 1991 


(11) 


7 


1 


Jl 


Jupaa 


Han et al., 1991 


(13) 


8 


1 


Al 


Australia 


Han et al., 1991 


(13) 


9 


1 


SI 


S. Africa 


Han et al.. 1991 


(13) 


10 


1 


Tl 


Taiwan 


Han et al., 1991 


(13) 


11 


I 


U18/K4 


U.S.A/Italy 


Han et al.. 1991 


(J3) 


12 


1 


HCV-J 


Japan 


Kaio et al., 1990 


(16) 


13 


1 


HCV-BK 


Japan 


Takamizawa et al.. 1991 


(33) 


14.15 


1 


HCJl. •J4 


Japan 


Okamoto et aL 1990 


(26) 


16-20 


1 


KL Kl- 1-4 


Japan 


Enomoto et al.. 1990 


(7) 


21 


1 


JH 


Japan 


Kubo et al.. 1990 


(17) 


22 


1 


n 


Japan 


Takeuchi et al., 1990 


(34) 


23-26 


2 


K2a.K2a-l. 


Japan 


Nakao et al.. 1991 


(23) 






K:2b. K2b-l 




Enomoto et al.. 1990 


(7) 


27 


0 


Clone A 


Japan 


Tsukiyaxna-Kohara. 1991 


(35) 
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NUCLEOTIDE DISTANCES BETWEEN THE THREE HCV TYPES 
m FOUR REGIONS OF THE GENOME. 



^'number of sequences analysed 
Va: not applicable 
"liAi t not done 



REGION 


TYres (in 


la 


lb 


2a 


2b 3 

• 


5'Nt3t 


I (20) 


0.0163 


n/a" 








2(6) 


0-0869 


n/a 


0.0214 






3(8) 


0.0948 


n/a 


0.1331 


n/a 0.0123 


CORE 


la (6) 


0.0358 










lb (5) 


0.0855 


0.0227 








3 ri) 


0.1801 


0.1511 


n/d^ 


n/d 0.0000 


NS-3 


la (34) 


0.0699 










lb (3) 


0J2270 


0.0535 








3 (4) 


0J689 


0.4279 


n/d 


n/d 0.0460 


NS-5 


la (4) 


0.0743 










ibcn 


0.2477 


0.0372 








2a (2) 


0.6092 


0.6206 


0.0612 






2b (3) 


0.5214 


0.5732 


0.2252 


0.0655 




3(4) 


0.4754 


0.4890 


0J983 


0.5299 0.0322 
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II) • SEROLOGICAL REACTIVITY OF BLOOD DONORS INFECTED WITH 

THREE DIFFERENT TYPES OF HEPATITIS C VIRUS. 

HCV sequences were amplified in the 5 'non-coding 
region (5'NCR), core, NS-3 and NS-5 regions from blood 
donors, haemophiliacs and intravenous drug abusers. 

Blood donations that were repeatedly reactive on 
screening with Abbott 2nd generation enzyme immunoassay 
(EIA) and positive or indeterminate by Ortho recombinant 
immunoblot assay (RIBA) were amplified by primers in the 
5 'NCR (reference 10). The first fourteen PCR-positive 
blood donations (where PCR was used to amplify and thus 
detect HCV RNA present in the blood) were then typed by 
sequence analysis of the amplified region, and compared 
with their serological reactivity to a range of structural 
and non-structural peptides in two 1st generation EIAs 
(Ortho HCV ELISA? Abbott HCV EIA) and two RIBA assays 
(Ortho RIBA and Innogenetics LIA; Table 4) , The five 
donations containing HCV type X sequences were positive in 
both EIAs, reacted with all antigens in the Ortho RIBA 
assay, and were broadly reactive in the LIA* However, all 
but two of the sera from donors with type 2 and 3 
infections were completely negative on anti-ClOO EIA 
screening and failed to react with 5-1-1, ClOO (RIBA) and 
NS4 (LIA). 
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Furthermore, some carriers of HC7 type 3 variants 
reacted poorly with the C33 (NS-3) peptide in the Ortho 
RIBA, and yielded two "indeterminate" results (donor nos. 
11 and 13) . 

Thus, current tests using Ortho RIBA and (to a lesser 
extent) Innogenetics LIA tests are unable to reliably 
detect HCV-2 and HCV-3 genotypes. For reliable testing 
for all HC7 types, antigens from S-1-1, ClOO and NS4 for 
each of the three types of HCV should preferably be 
included in the panel of antigens. 



•if 
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SEROLOGICAL REACnVTTY OF SERA FROM BLOOD DONORS 
INFECIID WTTH THREE TYPES OF HEPATITIS C VIRUS 



Doaor 
IStmber 


genotype 


ClOO 

0* At 


Ortho RXBA 
S-1-1 ClOO C33 C22 


Xtmogenetiea hXh 
NS4 Clt C2 C3 C4 


S*bl3 
B-bl5 
B-blfi 
B-bl7 
B*bl8 


1 


+ 
+ 


+ 
+ 


35 4 4 4 
4 4 4 4 
4 4 4 4 
4 4 4 4 
4 4 4 4 


25 3 1 2 1 1 
2 3 3 2 2 1 

2 3 2 3 3 - 

3 3 3 2 1 1 
3*211- 


B-b9 
S-blO 
B*blX 
B-bl2 


2 






13 4 
4 4 
* ^ 4 4 
4 4 


* - 3 113 
3 2 2 2 - 

- 3 4 2 2 3 

- 13 12 2 


B-bl 
E*b2 
E-*b3 
B-bS 
B-b7 


3 


+ 




4 4 
2 4 
2 4 


- 13 1-3 
^ 2X112 
2 2 1-2 2 1 

- 3 1 2 3 

- 2 3 114 



*Ortho HCV EUSA (Recombinant C 100-3) 

tAbbott HCV EIA (Hcpautis C Recombinant DNA Antigen) 

tCorc oligopeptides, 1-4 

§Band5 scored • (negative) to 4 (strong positive) according lo manufacturers instnictions. 
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PART ni. MAPPING OF ANTIGENIC DETERMINAN!fS IN N5-4 



introduction 

With an overall aim of improving serological 
screening assays, va have obtained sequence data from the 
antigenic region of region corresponding to clOO-3 for 
types 2 and 3. This information was used to epitope map 
the region f to define additional immunoreactive peptides 
that could be used to improve serological anti-HCV assays. 

Methods; 

PGR and secfuencincr . Plasma samples from Scottish blood 
donors yielding repeatedly reactive donations on 2nd 
generation anti«HCV screening (Abbott or ortho) / and which 
were confirmed or indeterminate on confirmatory testing by 
RIBA (Chiron) were referred to the Department of Medical 
Microbiology from the Scottish National Blood Transfusion 
Service Microbiology Reference laboratory. HC7 RNA within 
the plasma samples was extracted and amplified with 
primers in the 5 'NCR as described previously (Chan et al,, 
1992) . HCV was typed by sequence analysis of the 
amplified DNA as described previously (Simmonds et al,, 
1990) and by RFLP analysis. 

Five samples from different donors infected with HCV 
type 3 (nos. 40, 38, 36, 26 and 1787), four infected with 
type 2 (nos. 3S1,. 59, 940 and 810) and five with type 1 
infection (nos. 16, 42, 77, 1801 and 1825) were amplified 
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with primers corresponding to sense and ant i '-sense 
sequences spanning the antigenic region of NS-4 (table 
1) . Nucleotide sequences obtained from the amplified DNA 
were compared and used to define consensus sequences for 
each HCV type. In-frame translation of the nucleotide 
sequences yielded an uninterrupted consensus amino acid 
sequence that was used to define a series of overlapping 
oligopeptides for epitope mapping. 

Epitope mapping and determination of antibody specificities 

Overlapping synthetic peptides were synthesised on 
polypropylene pins using kits conusercially available from 
Cambridge Research Biochemicals Ltd. The principle of the 
addition reactions is described in refs (Geysen et al., 
1984; Geysen et al., 1985). Antibody reactions were 
carried out on pins disrupted by sonication (30 minutes) 
in 1% sodium dodecyl sulphate^ 0,1% 2-mercaptoethanol, 
0*1M sodium dlhydrogen orthophosphate* Pins were 
pre-coated in 1% ovalbumin, 1% bovine serum albumin, 0.1% 
Tween-20 in phosphate buffered saline (PBS) for one hour 
at room temperature. Semm or plasma was diluted 1*40 in 
PBS + 0.1% Tween-20 (PBST) and incubated with the blocked 
pins at 4*^C for 18 hours. After washing in 4 changes of 
PBST (10 minutes at room temperature, with agitation) , 
bound antibody was detected by incubation in a 1/20000 
dilution of affinity isolated anti-human IgG, peroxidase 
conjugate (Sigma) for one hour at room temperature- 
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Following washing (4 changes in PBST) , pins were incubated 
in a 0.05% solution of 

azino-di-3-ethyl-benzthiazodinsulphonate in o.i M sodium 
phosphate/sodium citrate buffer (pH 4.0) containing 0.03% 
hydrogen peroxide for 20 ininutes. Optical densities were 
read at 410 nm. 



HCV ENA in plasma saii«>les from five donors infected 
with HCV type 3 by sequence analysis of the 5 'NCR, and by 
RFLP were amplified in the NS-4 region using primers 
listed in Table 1. Because of the high degree of sequence 
variability in this region, it was necessary to use 
separate sense primers for the amplification of different 
HCV types. However, the anti-sense primers were in a 
highly conserved region and could be used for 
amplification of all three types. Sequence analysis was 
carried out as previously described. This gave a 
continuous sequence from position 4911 to 5271 (numbered 
as in Choo et al., 1991) (Figure 5a). Little sequence 
variability (highlighted) was observed between the four 
different donors in this region. 

The nucleotide sequences were used to deduce the 
sequence of the encoded peptide (Figure 5b) . The putative 
protein contains mainly hydrophUlic residues but no 
potential sites for H-linked glycosylation, Amino acid 
sequence variability with HCV type 3 was confined to only 
five residues (Figure 5b) . However, this region differed 
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considerably from the amino acid sequences of other blood 
donors infected with HCV types 1 and 2 (T16, 42, 77, 1801, 
1825, 351, 940 and 810; Figure 6a)* Sequence comparison 
between the major HCV types from residues 1679 to 1769 
reveals three regions of considerable amino acid sequence 
variability. Most of the observed differences between 
types involve non-synonymous amino acid substitutions, 
particularly alternation of acidic and basic residues in 
the hydrophillic regions. These changes would be expected 
to profoundly alter the overall conformation of the 
protein, and its antigenicity. 

The consensus amino acid sequences in this region of 
types 1*^3 (Figure 6b) were used to define three series of 
82 nonameric oligopeptides overlapping by eight of the 
nine residues with those before and after in the series 
(Figure 7a-c) . These were synthesised on a 12 x 8 arrays 
of polypropylene pins as described in Methods. Antibody 
reactivity to the Immobilised antigens on the pins was 
determined by indirect ELISA, using an overnight 
incubation with a 1/40 dilution of test serum overnight at 
4°c, followed by washing, and detection with an anti-human 
IgG-peroxidase conjugate and appropriate substrate (see 
Methods) . 

Reactivity of an anti-HCV negative, PCR-negative 
donor, with no Icnown risk factors for HCV infection with 
the three series of peptides was determined- No 
significant reactivity is shown with any of the 
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HCV-encoded oligopeptides. Reactivity of sera from three 
donors infected with HCV type 3 to each of the 
Oligopeptides is shown in Figures 9a-9c. All three sera 
reacted with peptides ranging from No.l3 (sequence 
KPALVPDKE; Figure 7) to Mo. 22 (sequence VLYQQYDEM) in the 
first antigenic region, although the precise peptides 
recognised varied slightly between individuals. All three 
sera reacted to vailing extents with a second antigenic 
region, lying in the range from oligopeptides 32 to 42 (of 
sequence ECSQAAPYI to QAQVIAHWQ) . Weaker and more 
variable reactivity was observed to peptides 48 to 53. 
Finally, significant reactivity was also observed to 
single oligonucleotides 2 C2 of 3 samples), 61 (2 of 3), 
66 (3 of 3) , 73 (3 of 3) and 80 (2 of 3) . 

•Che sequences of the major antigenic regions of HCV 
type 3 differ considerably from those encoded by any of 
the type 1 or type 2 variants- The region bounded by 
peptides 13 to 22 shows average homologies of 50% with HCV 
type 2 variants and 67% with type l. Between peptides 32 
to 42, there are homologies of 39% with type 2 and 58% 
with type 1 variants. Thus, although similar regions of 
each NS-4 sequence are antigenic, the actual epitopes 
differ considerably between HCV types. 



Discussion 

The NS-4 region of HCV type 3 shows considerable 
sequence divergence from other variants of HCV, that 
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exceeds that found in the core, NS-3 or NS-5 regions 
previously analysed (Chan et al, 1992). The function of 
the protein encoded by this region of the HCV genome is 
unknown, and the consequences of this variability on virus 
replication and pathogenesis are unknown. The fxinction of 
the NS*4 region in flaviviruses and pestiviruses is also 
poorly defined. 

The degree of amino acid sequence varieibility, and 
the nature of the amino acid substitutions indicate that 
the major sites of antibody reactivity are also those of 
antigenic variability. This undoubtedly underlies the 
restricted cross -reactivity of HCV type 1 NS«4 encoded 
gintigens with sera from individuals infected with 
different HCV types* Serological diagnosis of infection 
is currently based entirely on recombinant or synthetic 
oligopeptide sequences derived ultimately from HCV type l 
sequences (Choo et al., 1991)* The serological response 
to infection is often very restricted in its initial 
stages, with antibody to only one of the recombinant 
antigens used for screening* Not only does this present 
difficulties with supplementary antibody tests, where 
reactivity to two HCV-encoded antigens is required for 
confirmation, but can lead to an increased probability of 
failing to detect early infection with HCV types 2 and 3. 

Table 7 relates HCV typing determined by PCR to 
results obtained using type-specific antigens (TSA) and 
shows good correlation for HCVi-3 types. 
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TABLES 



SEQUENCES OF NS-4 ENCODED ANTIGENS FOR (A) IMPROVED 

SEROLOGICAL DIAGNOSIS, AND (B) FOR SEROLOGICAL 
DISCRIMINATION OF INFECTION WTIH DIFFERENT HCV TYPES 

A) 

ReeHin 2 a710-l728) 

ECSQAAPYIEQAQVIAHQF 

ECASKAALIEEGQRMAEML 
ECASRAALIEEG(^UAEML 



Reghm 2 (1710.1728) 

ECSQAAPYIEQAQVIAHQF 

ECASKAALIEEGQRMAEML 
ECASRAALIEEGQRIAEML 

ECSQHLPYEGMLAEQF 
ECSQHLPYIEGALAEQF 



* Amino add posidoos nnmbeied as in Choo et aL, (1991). _ 
T Amino add eodcs: A: alanine: R: armnine: N; aspaiadne: D: aspartic add; C: cysoane: Q: 
glotaininc- E: glutamic add: G: gfydnc: H: histidine; t isoleudnc; L: leudne: K: lysme: M: 
nuahionine: F: phenylalanine: P: proline: S: serine: T: threonine: W: tryptophan: Y: tyrosdae: 
V: valine. 

t Altemadve pqiddes. where these is variability wiriiin an HCV type,. 



Type Rfigfon 1 0<S»l-Vm)* 
3 KPALVPDREVLYCJQYDEMt 

2p RVWTPDKEILYEAEDEM 
RAVIAPDKEVLYEAFDEM 

a) 

TyjK Region I (1691-1708) 

3 KPALVPDKEVLYQQYDEM 

2 RWVTPDKEILYEAFDEM 
RAVIAPDKEVLYEAFDEM 

I KPAIIPDREVLYREFDEM 
KPAVIPDREVLYREFDEM 
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TABLE 6 

SEQUENCES OF OUGONUCLEOTIDES SUITABLE FOR DIRECT DEIECTION OF 
HCV TYPE 3 IN CLINICAL SPECIMENS BY POLYMERASE CHAIN REACTION 



Nmm Region Ptnition PQi.t Scquoices 5'-3't 



0O7 NS-4 5293 - AACTCGAGTATCCCACTGATGAAGTrcCACAT 

220 NS4 5278 - GACATGTGCTrCGCCCAGAA 

Type 3^1 

TS-3a NS-4 5140 + GCCGCCCCATATATCGAACA 

TS-3b NS^ 5161 + GCTCAGGTAATAGCCCACCA 

7)^2: 

TS-2a NS-4 5140 + AAAGCCGCCCTCATTGAGGA 

TS-2b NS-4 5161 + GGGCAGCGGATGGCGGAGAT 

TS-la NS-4 5140 + CACTTACCOTACATCGAGCA 

TS-lb NS-4 5161 + GGGATGATGCTCGCCGAGCA 



* Position of 5'base leiaiive to HCV genomic sequence in Choo et aL. (1991). 
t Oriemaiion of primer sequence (+: sense: -: ami-sense) 
t Abtveviations: A: adenine. C: cytiitine: G: gnanidine. T: Aymidine. 
TlVpe-specffic sense primers for amplification of HCV types 3. 2 and 1 variants. 
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TABLE 7 

COMPARISON OF SEROLOGICAL TYPING BY HCV-TSA WTIH PGR 



PGR* 



Number TYPE^EOFIC ANTIBODY 

tested 12 3 1+2 1+3 2f 3 NTS^ NR* 



57 63 - - - 1 - 3 3 



2 12-11 



3 47 



1 0 

1 . 45 - 2 - 4 4 



27 11 



•Gtmo^pe of HCV sequences amplified by PGR and typed by RFLP (McOmish etaL 1992) 

* NTS : No lype-spedfic antibody detected 

* NR : non-ieaciivB with NS-4 peptides 

* Samples fiom HCV-infected haemophiliacs, na-typcd by PGR. 
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PART IV IDENTIFICATION OF HCV TYPE -4 

Introduction 

Investigations were carried out on sequence 
variations in the 5' non-coding region (5 'NCR) of HCV 
samples from a variety of worldwide geographical locations 
(Figure 13), and also in the core region (Figures ISA and 
15B) . Phylogenetic analysis (Figures 14 and 16) revealed 
a new distinct HCV type which we refer to herein as HCV-4, 

Methods 

Samples * RNA was extracted from plasma samples that were 
repeatedly reactive on second generation screening assays 
for HCV, and which were either confirmed (significant 
reactivity with two or more antigens in the Chiron 
recombinant immxinoblot assay; Chiron Corporation, 
EmeiYville, California, USA) or indeterminate (reactive 
with only one antigen) from blood donors and patients with 
NANBH. Most of the samples containing sequences that 
differed substantially from known HCV types came from 
Egypt (Eg 1-33). Others came from Holland (NL-26) , Hong 
Kong (HK 1-4), Iraq (IQ-48) and XX (xx-(6) , 

Sequence determination , HCV sequences were reverse 
transcribed and amplified with primers matching conserved 
regions in the 5 'NCR as previously described [1]. For 
analysis of the core region, RNA was reverse transcribed 
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using a primer of sequence CA(T/C)GT(A/G)AGGGTATCGATGAC 
(5' bas^: xxst, numbered ^& in [20]). cDNA was amplified 
using this primer and a primer in the 5 'NCR of sequence 
ACTGCCTGATAGGGTGCTTGCGAG (5' base: -54). The second PCR 
used primers of sequences AGGTCTCGTAGACCGTGCATCATG (S' 
base: -21) and TTGCG (G/T/C) GACCT (A/T) CGCCGGGGGTC 
(5 'base: xx3C) - Amplified DNA in both regions was directly 
sequenced as described previously (reference la) * 

Sequence analysis. Sequences were aligned using the 
CKTSTAL program in the University of Wisconsin GCG package 
(reference 6) * Phylogenetic trees were constructed by the 
DNAHL program in the PHYLIP package of Felsenstein 
(version 3^4 ^ June 1991; (reference 9)^ using the global 
option. RHA secondary structures in the S'NCR of 4 
representative EC7 variants (refs) were predicted using 
the program FOLD* Three predictions were made from each 
sequence between nucleotides -341 to -1^ --341 to +300, and 
-341 to +900 to allow for possible long range interactions. 
Comparison of the predicted conformations for each 
sequence over the different lengths showed that only 
relatively small scale features, such as the stem/loop 
analysed in the Results were at all conserved (data not 
shown) . 

All sequences reported in this part have been 
submitted to GenBank. 
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Results 

Divergent S^NCR sequences. Several sequences in the 5'NC 
region detected in samples of blood donors from Saudi 
Arabia, Holland and Hong Kong, and from NANBH patients in 
Iraq and xxx differed substantially from those found in 
Scottish blood donors and those reported elsewhere (Figure 
13)* Instead of showing the well characterised nucleotide 
substitutions that distinguish HCV types 1, 2 and 3 from 
each other, a new set of sequence differences were 
observed in the new variants that appeared to place them 
outside the existing system of virus classification. This 
can be more simply represented by reconstructing a 
phylogeny of the sequences and presenting the results as 
an evolutionary tree (Figure 14) • This analysis confirms 
that sequences 1-10 cluster separately from the variants 
previously typed as 1,2 and 3. For convenience we will 
refer to sequences within this new group as type 
HCV type 4. Mean distances within type 4 and between type 
4 and the other HCV types in the 5 'NCR were comparable to 
those previously described for type 1-3* Although 
sequences within type 4 are relatively closely grouped, 
sequences 11, 12 and 13 differ considerably from any of 
the known types. 

Using this phylogenetic tree, it can be seen that the 
majority of previously published 5 'NCR sequences can be 
readily identified as types 1, 2 or 3, Furthermore, 
almost all of the sequences from Zaire (shown as hollow 
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squares) cluster closely within type 4, suggesting a wider 
distribution in Africa* However, a further complication 
is that three identical sequences obtained from South 
African patients appeared distinct from both the type 1 
cuid the type 4 group, and may represent yet another HCV 
type. 

RNA from three repres^tative type 4 variants (Eg 29, 
33, 21; corresponding to S'MCR sequences nos. 1-3) was 
amplified using primers in the core region of HCV 
polyprotain« All three sequences differed considerably at 
both the nucleotide and amino acid level from HCV types 1 
to 3 (Figure 15A/B) . Phylogenetic analysis of these 
sequences and those previously analysed indicated that 
th^y fonned a separate, relatively homogeneous group 
distinct from the other types (Figure 16) • Reconstructed 
nucleotide distances between type 4 and types 1-3 were 
comparable to those that exist between the three known HCV 
types of HCV. Although most of the nucleotide sequence 
differences were silent, there were between 4 and 9 amino 
acid differences between the new variants and other types. 
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PART V HCV TYPING 
Introduction 

In view of the saqaence variations between HCV types 
1,2,3 and 4 differences in restriction enzyme cleavage sites 
exist, leading to different endonuclease cleavage patterns. 
This technique was used to identify HCV genotypes in blood 
samples from a variety of sources worldwide. 

(A) Typing of HCVl-3 
METHODS 

Serum Samples ; Samples from blood donors in six countries, 
Scotland^ Finland, Netherlands, Hong Kong and Australia and 
Japan, were availcible from routine 2nd Generation anti-*HCV 
EUSA screening (Ortho or Abbott) . Donor samples that were 
repeatedly reactive in the above tests were further 
investigated using a supplementary test (Ortho RIBA; 
Finland, Netherlands, Australia, Egypt, Abbott Matrix: Hong 
Kong) or samples were titred for anti-HCV by ELISA (Japan) • 
Samples that were positive (significant reactivity with two 
or more HCV antigens (1+ to 4+) or indeterminate (reactivity 
with one antigen only) in the RIBA test or had a titre of > 
X 4096 by ELISA (Japan only) were tested for viral SNA by 
Polymerase Chain Reaction (PGR) . 

RNA PGR : PCR for the detection of HCV RNA was carried out as 
previously described by Chan et al (reference la) using 
primers in the 5 'non-coding region (5 'NCR) in a nested PGR, 
with primers 209/939 and 211/940(5) in first and second 
reactions respectively. 
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HCV TYPING 

The existence of relatively conserved patterns of 
substitutions in the 5 'NCR that are characteristic of 
different HC7 types provide useful signature sequ^ces for 
identification of HCV genotypes. Having compared large 
numbers of different HCV t^pa 1, 2 and 3 sequences, we 
developed a method that differentiated HCV types 1-3 by 
restriction endonuclease cleavage of amplified DNA. 
However, the 19 type 4 sequences would appear as type 1 
(electrophoretic types Aa and Kb) , and for concurrent 
studies it has been necessary to modify the conditions to , 
identify the new HCV type. All type 4 sequences showed a 
T — >C change at position -167 that creates a novel HinfX 
site that is absent in all type 1 (and type 2) sequences. 
In combination with ScrFI, and Haelll/Rsal, it has now 
proved possible to identify the new type reliably in 
numerous countries in the Middle East and elsewhere. 

RESULTS 

The results are summarised in Table 8 for HCV types 
1, 2 and 3. The Egyption samples gave abberant 
restriction patterns on the single ScrFI digest and were 
identified as type 4, 



wo 93/10239 



PCr/GB92/02143 



- 49 - 



TABLE 8 



PREVALENCE OF HCV TYPES IN DIFFERENT COUNTRIES 



COUNTRY 



HCV TYPES f%) 
HCV-1 HCV-2 



HCV-3 



Scotland 



86(51%) 21(13%) 60(36%) 



Finland 



3(25%) 



5(42%) 4(33%) 



Netherlands 



18(60%) 



7(23%) 5(17%) 



Hong Kong 



22(63%) 0(0%) 



0(0%) 



Australia 



13(57%) 3(13%) 7(30%) 



Japan 



31(77%) 9(23%) 



0(0%) 



Egypt 



0(0%) 0(0%) 



0(0%) 
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(B) MODIFICATION OF PCR-BASED TYPING ASSAY TO DETECT 
INFECTION Wrm HCV TYPE 4 IN CLINICAL SPECIMENS 

Methods 

Extraction of HMA > RNA was extracted from 100 ul aliquots 
of plasma of non-A, non-'B patients by addition of 1 ml 
BHAzol solution (2H guanidinium thioc^anate, 12.5 mM 
sodiiim citrate [pH7.0J;^ 0*25% w/v N-iauroylsarcosine^ 
0.05M 2-mercaptoethanol, lOOmH sodium acetate [pH 4.0]^ 
50% w/v water saturated phenol) as previously described 
(Chomczynski et al, 1987) , and mixed until precipitate 
dissolved,. After addition of 100 ul chloroform, each 
sample was spim for 5 minutes at 14000 x g and the aqueous 
phase re-extracted with 0.5 ml chloroform. ENA was 
precipitated by addition of an equal volume of isopropanol 
and incubation at -2oOc for at least 1 hour. An laiA 
pellet was produced by centrifugation at 14000 x g for 15 
minutes at 4*='C, washed in iml 70% cold ethemol solution^ 
dried and resuspended in 20 ul diethylpyrocarbonate 
treated distilled water- of the 100 directly extracted 
samples, a total of 19 were PCR-negative (see below) . Two 
ml volumes of the negative samples were ultracentrifuged 
at 200 000 x g for 2 hours and the pellet re-extracted as 
described above* Extraction from the larger volume of 
plasma yielded an additional 3 positive samples (numbers 
66, 80, 85)* 
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PCR and typing > RNA was reverse transcribed with primer 
940 and cDNA amplified in a two stage nested PGR reaction 
with primers 940/939, followed by 209/211 as previously 
described (Chan et al. 1992). PCR product was 
radiolabelled with [35s]-dATP analysed by restriction 
endonuclease cleavage (McOmish et al. Transfusion, 
32: no. 11 1992), Samples were cleaved with ScrFI and a 
combination of Haelll/Rsal in two separate reactions to 
identify HCV types 1/4, 2, 3. Figure 17 shows 
endonuclease cleavage patterns. HCV types 1 and 4 were 
differentiated by a third reaction with Hinfl (see 
Results) » Two samples yielded restriction patterns that 
were different from those of the four known types of HCV 
and were analysed further by direct sequence analysis of 
the amplified DNA (Chan et al» 1992)* These two samples 
contained 5 'NCR sequences distinct from those of known HCV 
types and currently remain unclassified. 

RESULTS 

Modification of RFLP method to identify HCV type 4. 
Previous sequence analysis in the 5 'NCR of HCV amplified 
from plasma of Egyptian blood donors revealed a relatively 
homogeneous group of novel secpience variants in both the 
5 'NCR and core region which were as distinct from HGV 
types 1, 2 and 3 as these latter types were from each 
other (see previous submission) . This new group was 
designated as HCV type 4. 
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Comparison of cleavage patterns of type 4 sequences 
witn those of type RFLP analysis of the previously 
identified type 4 sequences produced a distribution of 
electropherotypes with ScrFI and Haelll/Rsal similar to 
that HCV type 1 (Table 9) . Type 1 sequences yielded 9 
patterns of aA/B, 35 of bA/B and 1 bC. With these enzymes 
alone, type 4 sequences were thus indistinguishable from 
type 1 (14 aA/B, 4 hft/B) • However, type 1 and type 4 
sequences consistently differ in the number of Hinf I 
sites. All 18 type 4 sequences contain one or two 
potential cleavage sites (producing patterns band c; table 
B) while none are found in any of the 45 type 1 sequences 
analysed (pattern a) . One of the type 4 sequences was 
further differentiated from type 1 and other HCV types by 
the loss of a restriction site for Rsal, leading to a new 
pattern of bands designated h (44, 172, 9, 26; first 
column. Table 9) . Finally, a single sequence, EG-28 lost 
two sites to produce bands of 216, 9, and 26 bps (pattern 
i; Table 9) . This sequence was distinct from that of any 
of the known HCV l^es (including type 4) and is shown in 
the table in the column labelled XJ (unclassified) 

Typing of study sublects. RNA was extracted from 100 
samples of patients with NANB hepatitis and amplified with 
primers in the 5 'NCR. Of these, 84 were PGR positive, and 
enabled HC\r l^ing to be cairried out by RFLP. This was 
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initially carried out with Haelll/Rsal and ScrFl, and 
allowed the identification of 10 type 2 and 10 type 3 
variants (Table 10) . Samples showing electrophoretic 
patterns aA/B or bA/B were further analysed by cleavage 
with hinfl, yielding 38 samples with pattern a, thus 
identified as type 1, 22 with pattern b and 2 with pattern 
both identified as type 4* Finally, two samples showed 
the unusual cleavage patterns h and i with Haelll/Rsal and 
pattern b with Hinfl, and were therefore directly 
sequenced. These two sequences were similar to each other 
but were unlike any of the known HCV type^, and also 
distinct from EG-28, the other sequence showing pattern i 
with Haelll/Rsal (Table 10) . As they cannot be currently 
classified, they will be referred to as type 
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TABLE 9 



PREDIOED CLEAVAGE PATTERNS OF PUBLISHED 5'NCR SEQl^CES 
OP BCV TYPES 1. 2, 3 AND 4 WIIH RsaUHaeM. ScrFl AND Hm 



Fndicted deavage pattern' HCVQrpe 

1 2 3 4 0' 



IttMul ScrFl Hinfl 



a 



a 

a 

b A/B b 



c 



d Da 



e 
e 



E a 



A« a« 9 

b AAB a 35 

b C a 1 



Am b" - - - 13 

AAB e ' ' ' \ 

_ - - 4 



D a - 5 



1 



E a - 2 

D a - 1 



1 



f G b . - 14 

f G c - - 1 

g G b - - 8 

MB b - - - 

i» A/B b - - - 



' Cteawge patteins designaied fbrHfldII//2iflI and SarFl as described previously (McOmish 
1992). 

" Cleavage panem of an HCV variant of nndesigned qrpe 
^ FisilBm a : ondeaved by Ho^ 

*PaiiBnib:DNAclcaved to generate two fiagments of sizes 107 and 142 bps (m order 5 -> 
3*) 

• panem c : DNA cleaved to geneiaue three fiagments of 56, 51 and 142 bps 
'Newdcavage pattern ioLHtKmiRsOL designated h (bands of 44 bps, 172 bps. 9 bps, 26 bps) 
» New cleavage pattern for HaeBURsdL designated i {216 bps. 9 bps. 26 bps) 
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TABLE 10 

IDENTTHCA-nON OF HCV TYPES 1-4J^STUDY SUBJECTS BY RJ^ 
ZiALYSlS OF 5'NCR SEQUENCES WITH RsdUHaem. ScrFl AND W 



/8mI 


cleavage 

SerFl 


! pattern 

Hinfl 


Inferred HCV type 
12 3 4 




a 
b 


m 

A/B 


a 
a 


2 
36 


- 


- 


- 


a 
a 
b 


A/B 
A/B 
A/B 


b 
c 
b 






16 

2 
6 


- 

— — = 


c 
d 


D 

E 


ILd. 

ILd. 


7 
3 








f 
S 


G 
G 


SLd 

iLd. 




7 
3 






h 

* 

1 


A/B 
A/B 


b 
b 








I 
1 






TOTALS 


38 10 


10 


24 


2 



• Two samples yielded un««ial ie«riction paoems ^ HaemRsdi (h. ^^^J^' 
of Se 5'NCR placed them outside ejdsting HCV classification (samples ICM8, EG-%). 
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EMlg VX Expression and Assay etc. Techniques 

The present invention also provides expression 
vectors containing the DNA sequences as herein defined, 
which vectors being capable/ in an appropriate host^ of 
expressing the DNA sequence to produce the peptides as 
defined herein. 

The expression vector normally contains control 
elements of DNA that effect expression of the OKA sequence 
in an appropriate host^ These elements may vary according 
to the host but usually include a promoter, ribosome 
binding site, translatlonal start and stop sites, and a 
transcriptional termination site. Examples of such 
vectors include plasmids and viruses. Expression vectors 
of the present invention encompass both extrachromosomal 
vectors and vectors that are integrated into the host 
cell's chromosome. For use in E^coli, the expression 
vector may contain the DHA sequence of the present 
invention optionally as a fusion linked to either the 5'- 
or 3 '-end of the DNA sequence encoding, for example, 
B-galactosidase or to the 3 '-end of the DHA sequence 
encoding, for example, the trp £ gene. For use in the 
insect baculovirus (AcNPV) system, the DHA sequence is 
Optionally fused to the polyhedrin coding sequence. 

The present invention also provides a host cell 
transformed with expression vectors as herein defined. 

Examples of host cells of use with the present 
invention include proJcaryotic and eukaryotic cells, such 
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aa bacterial, yeast, mammal ian and insect cells* 
Particular examples of such cells are E.coli. 
S.cerevisiae. P>pastoris, Chinese hamster ovary and mouse 
cells, and Spodoptera fruaiperda and Tricoplusia ni^ The 
choice of host cell may depend on a number of factors but, 
if post-translational modification of the HCV viral 
peptide is important, then an eulcaryotic host would be 
preferred* 

The present invention also provides a process for 
preparing a peptide as defined herein which comprises 
isolating the DNA sequence, as herein defined, from the 
HCV genome, or synthesising DNA sequence encoding the 
peptides as defined herein, or generating a DNA sequence 
encoding the peptide, inserting the DNA sequence into an 
expression vector such that it is capable, in an 
appropriate host, of being expressed, transforming host 
cells with the expression vector, culturing the 
transformed host cells, and isolating the peptide* 

The DNA sequence encoding the peptide may be 
synthesised using standard procedures (Gait, 
Oligonucleotide Synthesis; A Practical Approach , 1984, 
Oxford, IRL Press) . 

The desired DNA sequence obtained as described above 
may be inserted into an expression vector using known and 
standard techniques* The expression vector is normally 
cut using restriction enzymes and the DNA sequence 
inserted using blunt-end or staggered-end ligation. The 
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cut is usually made at a restriction site in a convenient 
position in th& expression vector such that, once 
inserted^ the DNA sequences are under the control of the 
functional elements of DNA that effect its expression. 

Transformation of an host cell may be carried out 
using standard techniques. Some phenotypic marker is 
usually employed to distinguish between the transformants 
that have successfully taken up the expression vector and 
those that have not. Culturing of the transformed host 
cell and isolation of -Oie peptide as required may also be 
carried out using standard techniques. 

ThB peptides of the present invention may be prepared 
by synthetic methods or by recombinant OUA technology. 
Xhe peptides are prefercibly synthesized using automatic 
synthesizers. 

Antibody specific to a peptide of the present 
invention can be raised using the peptide. The antibody 
xoay be polyclonal or monoclonal, ^e antibody may be used 
in quality control testing of batches of the peptides; 
purification of a peptide or viral lysate; epitope 
mapping; when labelled, as a conjugate in a competitive 
type assay, for antibody detection; and in antigen 
detection assays. 

Polyclonal antibody against a peptide of the present 
invention may be obtained by injecting a peptide, 
optionally coupled to a carrier to promote an immune 
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response, into a xnammalian host, such as a mouse, rat, 
shaep or rabbit, and recovering the antibody thus 
produced. The peptide is generally administered in the 
form of an injectable formulation in which the peptide is 
admixed with a physiologically acceptable diluent. 
Adjuvants, such as Freund's complete adjuvant (FC!A) or 
Freund's incomplete adjuvant (FIA) , may be included in the 
formulation. The formulation is normally injected into 
the host over a suitable period of time, plasma samples 
being taken at appropriate intervals for assay for 
anti-HCV viral antibody. When an appropriate level of 
activity is obtained, the host is bled. Antibody is then 
extracted and purified from the blood plasma using 
standard procedures, for example, by protein A or 
ion*exchange chromatography. 

Monoclonal antibody against a peptide of the present 
invention may be obtained by fusing cells of an 
immortalising cell line with cells which produce antibody 
against the viral or topographically related peptide, and 
culturing the fused immortalised cell line. Typically, a 
non^human mammalian host, such as a mouse or rat, is 
inoculated with the peptide. After sufficient time has 
elapsed for the host to mount an antibody response, 
antibody producing cells, such as the splenocytes, are 
removed* Cells of an immortalising cell line, such as a 
mouse or rat myeloma cell line, are fused with the 
antibody producing cells and the resulting fusions 
screened to identify a cell line, such as a hybridoma, 
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that secretes the desired monoclonal antiiody. The fused 
cell line may be cultured and the monoclonal antibody 
purified from the culture media in a similar manner to the 
purification of polyclonal antibody. 

Diagnostic assays based upon the present invention 
may be used to determine the presence or absence of HCV 
infection. OSxey may also be used to monitor treatment of 
such infection, for example in interferon therapy. 

Ill an assay for the diagnosis of viral infection, 
there are basically three distinct approaches that can be 
adopted involving the detection of viral nucleic acid, 
viral antigen or viral antibody. Viral nucleic acid is 
generally regarded as the best indicator of the presence 
of the virus itself and would identify materials likely to 
be infectious. However, the detection of nucleic acid is 
not usually as straightforward as the detection of 
antigens or antibodies since the level of target can be 
very low. Viral antigen is used as a marker for the 
presence of virus and as an indicator of infeetivity. 
Depending upon the virus, the amount of antigen present in 
a sample can be very low and difficult to detect. 
Antibody detection is relatively straightforward because, 
in effect, the host immune system Is amplifying the 
response to an infection by producing large amounts of 
circulating antibody. The nature of the antibody response 
can often be clinically useful, for example XgM rather 
than IgG class antibodies are indicative of a recent 
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infection, or the response to a particular viral antigen 
may be associated with clearance of the virus. Thus the 
exact approach adopted for the diagnosis of a viral 
infection depends upon the particular circiamstances and 
the information sought. In the case of HCV, a diagnostic 
assay may embody any one of these three approaches* 
In an assay for the diagnosis of HCV involving 
detection of viral nucleic acid, the method may comprise 
hybridising viral RNA present in a test sample, or cDNA 
synthesised from such viral RNA, with a DNA sequence 
corresponding to the nucleotide sequences of the present 
invention or encoding a peptide of the invention, and 
screening the resulting nucleic acid hybrids to identify 
any HCV viral nucleic acid. The application of this 
method is usually restricted to a test sample of an 
appropriate tissue, such as a liver biopsy, in which the 
viral RNA is likely to be present at a high level. The 
DNA sequence corresponding to a nucleotide sequence of the 
present invention or encoding a peptide of the invention 
may take the form of an oligonucleotide or a cDNA sequence 
optionally contained within a plasmid. screening of the 
nucleic acid hybrids is preferably carried out by using a 
labelled DNA secpience. Preferably the peptide of the 
present invention is part of an oligonucleotide wherein 
the label is situated at a sufficient distance from the 
peptide so that binding of the peptide to the viral 
nucleic acid is not interfered with by virtue of the label 
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being too close to the binding site, one or more 
aaaitional rounds of screening of one kind or another may 
be carried out to cdiaracterise further the hybrids and 
thus identify any HC\r viral nucleic acid- The steps of 
hybridisation and screening are carried out in accordance 
with procedures known in the art* 

The present invention also provides a test kit for 
the detection of HCV viral nucleic acid, which comprises 
i) a labelled oligonucleotide comprising a DNA sequence 

of the present invention or encoding a peptide of the 

present invention; and 
il) washing solutions, reaction buffers and a substrate, 

if the label is an enzyme. 

Advantageously, the test kit also contains a positive 
control sample to facilitate in the identification of 
viral nucleic acid. 

In an assay for the diagnosis of HCV involving 
detection of viral antigen or antibody, the method may 
comprise contacting a test sample with a peptide of the 
present invention or a polyclonal or monoclonal antibody 
against the peptide and determining whether there is any 
antigen-antibody binding contained within the test sample. 
For this purpose, a test kit may be provided comprising a 
peptide, as defined herein, or a polyclonal or monoclonal 
antibody thereto and means for determining whether there 
is any binding with antibody or antigen respectively 
contained in the test sample • The test sample may be 
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taken from any of the appropriate tissues and 
physiological fluids mentioned above for the detection of 
viral nucleic acid. If a physiological fluid is obtained, 
it may optionally be concentrated for any viral antigen or 
antibody present. 

A variety of assay formats may be employed. The 
peptide can be used to capture selectively antibody 
against HCV from solution ^ to label selectively the 
antibody already captured, or both to capture and label 
the antibody, in addition, the peptide may be used in a 
variety of homogeneous assay formats in which the antibody 
reactive with the peptide is detected in solution with no 
separation of phases. 

The types of assay in which the peptide is used to 
capture antibody from solution involve immobilization of 
the peptide on to a solid surface. This surface should be 
capable of being washed in some way. Examples of suitable 
surfaces include polymers or various types (moulded into 
microtitre wells; beads; dipsticks of various types; 
aspiration tips; electrodes; and optical devices), 
particles (for example latex; stabilized red blood cells; 
bacterial or fungal cells; spores; gold or other metallic 
or metal-containing sols; and proteinaceous colloids) with 
the usual size of the particle being from 0.02 to 5 
microns, membranes (for example of nitrocellulose; paper; 
cellulose acetate; and high porosity/high surface area 
membranes of an organic or inorganic material) . 
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The attachment of the peptide to the surface can be 
by passive adsoption from a solution of optimum 
composition which may include surfactants^ solvents ^ salts 
and/or chaotropes; or by active chemical bonding • Active 
bonding may be through a variety of reactive or 
activatible functional groups which may be exposed on the 
surface (for exaii4>le condensing agents; active acid 
esters^ halides and anhydrides; amino, hydroxyl, or 
carbos^l groups; sulphydryl groups; carbonyl groups; 
diazo groups; or unsaturated groups). Optionally, the 
active bonding may be through a protein (itself attached 
to the surface passively or through active bonding) , such 
as albumin or casein, to which the viral peptide may be 
chemically bonded by any of a variety of methods. The use 
of a protein in this way may confer advantages because of 
isoelectric point, charge, hydrophilicity or other 
physico'-'ChemicaX property. The viral peptide may also be 
attached to the surface (usually but not necessarily a 
membrane) following electrophorectic separation of a 
reaction mixture, such as immunoprecipitation* 

After contacting (reacting) the surface bearing the 
peptide with a test sample, allowing time for reaction, 
and, where necessary, removing the excess of the sample by 
any of a variety of means, (such as washing, 
centrifugation, filtration, magnetism or capilliary 
action) the captured antibody is detected by any means 
which will give a detectable signal* For example, this 
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may be achieved by use of labelled molecule or particle as 
described above which will react with the captured 
antibody (for example protein A or protein G and the like; 
anti-species or anti-immunoglobulin-sub-type? rheumatoid 
factor; or antibody to the peptide, used in a competitive 
or blocking fashion) , or any molecule containing an 
epitope contained in the peptide. 

The detectable signal may be optical or radioactive 
or physico-^chemical and may be provided directly by 
labelling the molecule or particle with, for example, a 
dye, radiolabel, electroactive species, magnetically 
resonant species or fluorophore, or indirectly by 
labelling the molecule or particle with an enzyme itself 
capable of giving rise to a measurable change of any 
sort. Alternatively the detectable signal may be obtained 
using, for example, agglutination, or through a 
diffraction or birefringent effect if the surface is in 
the form of particles. 

Assays in which a peptide itself is used to label an 
already captured antibody require some form of labelling 
of the peptide which will allow it to be detected. The 
labelling may be direct by chemically or passively 
attaching for example a radio label, magnetic resonant 
species, particle or enzyme label to the peptide; or 
indirect by attaching any form of label to a molecule 
which will itself react with the peptide* The chemistry 
of bonding a label to the peptide can be directly through 
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a moiety already present in the peptide, such as an amino 
group, or through an intermediate moiety, such as a 
maleimide group, capture of the antibody may be on any of 
the surfaces already mentioned by any reagent including 
passive or activated adsorption which vrill result in 
specific antibody or immune complexes being bound. In 
particular, capture of the antibody could be by 
anti-species or anti-immunoglobulin-^sub-type, by 
rheumatoid factor, proteins A, G and the liJce, or by any 
molecule containing an epitope contained in the peptide. 

The labelled peptide may be used in a competitive 
binding fashion in which it3 binding to any specific 
molecule on any of the surfaces exemplified above is 
blocked by antigen in the sample* Alternatively, it may 
be used in a non--competitive fashion in which antigen in 
the sample is bound specifically or non-specifically to 
any of the surfaces above and is also bound to a specific 
bi- or poly-valent molecule (e.g. an antibody) with the 
remaining valencies being used to capture the labelled 
peptide. 

Often in homogeneous assays the peptide and an 
antibody are separately labelled so that, when the 
antibody reacts with the recombinant peptide in free 
solution, the two labels interact to allow, for example, 
non-radiative transfer of energy captured by one label to 
the other label with appropriate detection of the excited 
second label or quenched first label (e.g. by fluorimetry. 
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magnetic resonance or enzyme measurement) • Addition of 
either viral peptide or antibody in a sample results in 
restriction of the interaction of the labelled pair and 
thus in a different level of signal in the detector. 

A suitable assay format for detecting HCV antibody is 
the direct sandwich enzyme immunoassay (EIA) format. A 
peptide is coated onto microtitre wells* A test sample 
and a peptide to which an enzyme is coupled are added 
simultaneously* Any HCV antibody present in the test 
sample binds both to the peptide coating the well and to 
the enzyme-coupled peptide* Typically, the same peptide 
are used on both sides of the sandwich. After washing, 
bound enzyme is detected using a specific siibstrate 
involving a colour change. A test kit for use in such an 
EIA comprises: 

(1) a peptide^ as herein defined labelled with an enzyme; 

(2) a substrate for the enzyme; 

(3) means providing a surface on which a peptide is 
immobilised; and 

(4) optionally, washing solutions and/or buffers. 

It is also possible to use IgG/IgM antibody capture 
ELISA wherein an antihuman antibody is coated onto 
microlitre wells, a test sample is added to the well- Any 
IgG or IgM antibody present in the test sample will then 
bind to the anti-human antibody. A peptide of the present 
invention, which has been labelled, is added to the well 
and the peptide will bind to any IgG or IgM antibody which 
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has resulted due to infection by HCV. The IgG or IgM 
antibody can be visualized by virtue of the label on the 
peptide. 

It can thus be seen that the peptides of the present 
invention may be used for the detection of HCV infection 
in many formats, namely as free peptides, in assays 
including classic ELISA, competition ELISA, membrane bound 
EIA and immunopreclpitation. Peptide conjugates may be 
used in aa^lified assays and IgG/IgM antibody capture 
ELISA. 

An assay of the present invention may be used, for 
example, for screening donated blood or for clinical 
purposes, for example, in the detection and monitoring of 
HCV infections. For screening purposes, the preferred 
assay formats are those that can be automated, in 
particular, the microtitre plate format and the bead 
format. For clinical purposes, in addition to such 
formats, those suitable for smaller-scale or for single 
use, for example, latex assays, may also be used. For 
confirmatory assays in screening procedures, antigens may 
be presented on a strip suitable for use in Western or 
other immunoblotting tests. 

As indicated above, assays used currently to detect 
the presence of anti-HCV antibodies in test samples, 
particularly in screening donated blood, utilise antigenic 
peptides obtained from HIV type 1 only and, as 
demonstrated herein, such antigens do not reliably detect 
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Other HCV genotypes- Accordingly, it is clearly desirable 
to supplement testing for HIV-1 with testing for all other 
^ genotypes, for example, types 2, 3 and 4, and also any 

further genotypes that may be discovered. 

To test for a spectrum of genotypes, there may be 
provided a series of assay means each comprising one or 
more antigenic peptides from one genotype of HCV, for 
example, a series of veils in a microtitre plate, or an 
equivalent series using the bead format. Such an assay 
format may be used to determine the genotype of HCV 
present in a sample* Alternatively, or in addition, an 
assay means may comprise antigenic peptides from more than 
one genotype, for example, a microwell or bead may be 
coated with peptides from more than one genotype* 

It has been found advantageous to use more than one 
HCV antigen for testing, in particular, a combination 
comprising at least one antigenic peptide derived from the 
structural region of the genome and at least one antigenic 
peptide derived from the non-structural region, especially 
a combination of a core antigen and at least one antigen 
selected from the NS3, NS4 and NS5 regions. The wells or 
beads may be coated with the antigens individually* It 
has been found advantageous, however, to fuse two or more 
antigenic peptides as a single polypeptide, preferably as 
a recombinant fusion polypeptide. Advantages of such an 
approach are that the individual antigens can be combined 
in a fixed, predetermined ratio (usually equimolar) and 
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that only a single polypeptide needs to be produced, 
purified and characterised, one or more such fusion 
polypeptides may be used in an assay, if desired in 
addition to one or more unfUsed peptides. It will be 
appreciated that there are many possible combination© of 
antigens in a fusion polypeptide , for example, a fusion 
polypeptide may con^rlse a desired raonge of antigens from 
one serotype only, or may comprise antigens from more than 
one serotype* The antigenic peptides from serotypes 2, 3 
and 4 are preferably those described herein. 

To obtain a polypeptide comprising multiple peptide 
antigens, it is preferred to fuse the individual coding 
sequences into a single open reading frame. The fusion 
should, of course, be carried out in such a manner that 
the antigenic activity of each component peptide is not 
significantly compromised by its position relative to 
another peptide* Particular regard should of cotirse be 
had for the nature of the sequences at the actual junction 
between the peptides. The resulting codiing sequence can 
be expressed, for example, as describad above in relation 
to recombinant peptides in general. The methods by which 
such a fusion polypeptide can be obtained are known in the 
art, and the production of a recombinant fusion 
polypeptide comprising multiple antigens of a strain of 
HCV type 1 is described in GB-A-2 239 245 
immunoprecipitation. Peptide conjugates may be used in 
amplified assays and IgG/IgM antibody capture ELISA. 
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The peptide of the present invention may be 
incorporated into a vaccine formulation for inducing 
immunity to HCV in man» For this purpose the peptide may 
be presented in association with a pharmaceutically 
acceptable carrier* 

For use in a vaccine formulation, the peptide may 
optionally be presented as part of an hepatitis B core 
fusion particle, as described in Clarke et al f Nature, 
1907, 330 , 381-384), or a polylysine based polymer, as 
described in Tam ( PNA5 . 1988, 85, 5409-5413). 
Alternatively, the peptide may optionally be attached to a 
particulate structure, such as lipsomes or ISCOMS. 

Pharmaceutically acceptable carriers include liquid 
media suitable for use as vehicles to introduce the 
peptide into a patient. An example of such liquid media 
is saline solution* The peptide may be dissolved or 
suspended as a solid in the carrier. 

The vaccine formulation may also contain an adjuvant 
for stimulating the immune response and thereby enhancing 
the effect of the vaccine. Examples of adjuvants include 
aluminium hydroxide and aluminium phosphate. 

The vaccine formulation may contain a final 
concentration of peptide in the range from 0.01 to 
5 mg/ml, preferably from 0.03 to 2 mg/ml. The vaccine 
formulation may be incorporaterd into a sterile container, 
which is then sealed and stored at a low temperature, for 
example 4^G, or may be free^e-dried. 
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In order to induce imiunity in man to HCV, one or 
more doses of the vaccine formulation may be 
administered. Each dose may be o.i to 2 ml, preferably 
0.2 to 1 ml. A method for inducing immunity to in 
man, comprises the administration of an effective amount 
of a vaccine formulation, as hereinbefore defined. 

The present invention also provides the use of a 
p^tide as herein defined in the preparation of a vaccine 
for use in the induction of immunity to HCV in man. 

vaccines of the present invention may be administered 
by any convenient method for the administration of 
vaccines including oral and parenteral (e.g. intravenous, 
subcutaneous or intramuscular) injection. The treatment 
may consist of a single dose of vaccine or a plurality of 
doses over a period of time. 
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SEQUENCE LISTING /FIGURE 1 and la) 



SEQ ID NO 



SEQUENCE TYPE : Nucleotide cDNA sequence 



SEQUENCE LENGTH : 194 base pairs 



STRANDEDNESS : single 



TOPOLOGY 



linear 



MOLECULAR TYPE : genomic DNA from HCV types 1-3 



ORIGINAL SOURCE 



ORGANISM 



: human blood samples 



USE 



DEPOSIT 



FEATURES 



Bases - 255 to -62 of the 5' non-coding region showing 
variations in cDNA sequence between HCV types 1-3, 
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SEQUENCE J T<zn^rna (FIGimE 3) 



SEQ ID NO 



SEQUENCE 



TYPE : deduced peptide sequence 



SEQUENCE LENGTH : 85 amino acids 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE t HCV peptide 



ORIGINAL SOURCE 



ORGANISM 



: human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 2648 to 2732 of NS-5 region showing variations 
in peptide sequence between HCV types 1-3. 
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SEQUENCE LISTING fFIGUHE 5) 



SEQ ID NO 



SEQUENCE TYPE : deduced peptide sequence 



SEQUENCE LENGTH : 57 amino acids 



STRANDEDNESS 



single 



TOPOLOGY 



: linear 



MOLECULAR TYPE : HCV peptide 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 1577 to 1633 of NS-3 region showing variations 
in peptide sequence between HCV types 1-3 • 



wo 93/10239 



PCr/GB92/02143 



fiE QDENCE I T<;TTWG (FIGURE 7) 



SEQ ID NO 



SEQUENCE TYPE : deduced peptide sequence 



SEQUENCE LENGTH : 124 amino acids 



STSANDEDNESS 



single 



TOPOLOGY 



r linear 



MOLECOIAR TYPE : HC7 peptide 



nPTCTWAL SOURCE 



ORGANISM 



: human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 5 to 128 of core region showing variations in 
peptide sequence between HCV types 1-3, 
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SEQUENCE LISTING (FIGURE 9a) 



SEQ ID NO 



SEQUENCE TYPE : Nucleotide sequence 



SEQUENCE LENGTH ; 367 base pairs 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : HCV genomic DNA 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Bases 4911 to 5277 of NS-4 region of HCV-3 showing 
individual variations and consensus sequence. 



- PCr/GB92/02143 

WO93/1023!> 

- 84 - 

fiK QUEWCE LT g-PTWA fPIGTOE 9b > 
SEQ ID NO : 6 

SEQUENCE TYPE : deduced peptide sequence 
SEQUENCE LENGTH : X28 amino acids 
STRANDEDNESS : single 
TOPOLOGY : linear 

MOLECULAR TYPE : HCV genomic DNA 

OPTGTMAL SOURCE 

ORGANISM : human blood samples 

USE 

DEPOSIT • 
FEATURES 

Amino acids 1638 to 1765 of NS-4 region of HCV-3 showing 
individual variations and consensus sequence. 
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SEQUENCE LISTING (FIGURE 10a and 10b) 



SEQ ID NO 



SEQUENCE TYPE : peptide sequence 



SEQUENCE LENGTH : 90 amino acids 



STRANDEDNESS : single 



TOPOLOGY 



linear 



MOLECULAR TYPE ; HCV peptide 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Amino acids 1679 to 1768 of NS-4 region of HCV1--3 showing 
individual variations and consensus sequence. 
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SE QUENCE I.T.qTTNG fFl ^TPK 11a and llcl 



SEQ ID NO 



SEQUENCE 



TYPE : Nonameric peptide sequences 



SEQUENCE I^NGTH 



: 9 amino acids each 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : nonameric HCV peptides 



nPTGTNAL SOURCE 



ORGANISM 



; synthetic 



USE 



epitope mapping 



DEPOSIT 



FEATURES 



Nanapeptides corresponding to NS4 region of HCVl-3 used 
for epitope mapping. 
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SEQUENCE LISTING (FIGURE 13) 



SEQ ID NO 



SEQUENCE TYPE : Nucleotide cDNA sequences 



SEQUENCE LENGTH : 30, 70 and 32 base pairs 



STRANDEDNESS 



single 



TOPOLOGY 



linear 



MOLECULAR TYPE : genomic UNA from HCV types 1 to 4 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Bases -245 to -216; -185 to -116; and -101 to --70 in the 
5 'NCR region of HCV types 1-4. 
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SE QUENCE LISTING fFIGU RE ISA and X5B) 



SEQ ID NO : 10 

SEQUENCE TYPE : Nucleotide with derived protein sequence 

SEQUENCE UENGTH : 240 base pairs 

STRANDEDNESS : single 

TOPOLOGY s linear 

MOLECULAR TYPE : genomic UNA 



ORIGINAL SOURCE 



ORGANISM J human blood samples 



USE 



DEPOSIT = 
FEATURES 

Base pairs 23 - 262 and amino acids 5 to 89 of the core 
region of HCV types 1-4. 
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CLAIMS 

1. A polynucleotide sequence unique to hepatitis C virus 
type 3 or 4 (HCV-3 or HCV-4) . 

2. A polynucleotide sequence according to claim 1 which 
is a cDNA sequence. 

3. An HCV-3 or HCV-4 specific polynucleotide sequence 
from the core, NS3, NS4 or NS5 regions. 

4. An HCV-3 or HCV-4 specific polynucleotide sequence 
comprised within the sequences shown in Figures 1 and la, 
9a, 13 and 15a. 

5- An HCV-3 or HCV-4 polynucleotide sequence which codes 
for an antigenic peptide. 

6. An HCV-3 or HCV-4 polynucleotide sequence which codes 
for C-100, 5-1-1, 033 or C22 peptides having sequence 
alterations unique to HCV-3 or HCV-4- 

7. An HCV-3 polynucleotide sequence encoding the 
polypeptide 

KPALVPDKEVLYQQYDEM or the polypeptide 
ECSQAAPyiEQAQVIAHQF 
or polypeptides of substantially equivalent antigenicity- 
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8. A hepat-itus C vims type 2 (HCV*2) polynucleotide 
sequence encoding the polypeptide 

RCA/V)V(V/I) (A/T)PDKE(1/V)LYEAFDEM or the polypeptide 

EGAS (K/R) AALIEEGQR (M/I ) AEML 
or polypeptides of substantially equivalent antigenicity. 

9- An antigenic HCV-3 or HCV*4 specific peptide. 

10. An antigenic HCV-3 or HC7-4 specific peptide from the 
core, NS3, or NS5 regions. 

11- An antigenic HCV-3 specific peptide comprised within 
the sequences shown in Figures 3 ^ 5 or 7 - 

12* An antigenic HCV-3 or HCV-4 specific peptide which 
corresponds to c-lOO, 5-1-1, C33 or C22 proteins having 
sequence alterations unique to HCV-3 or HCV-'4. 

13. An antigenic NS4 peptide specific to HCV-3 or HCV-4. 

14* An antigenic HCV-2 specific peptide which is 
R{A/V)V(V/I) (A/T)PDKE(1/V)LYEAFDEM or 
EGAS (K/R) AALIEEGQR(M/I) AEML 

15. An antigenic HCV-3 specific peptide which is 
KPALVPDKEVLYQQYDEM or 
ECSQAAPYIEQAQVIAHQF 
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16. An antigenic NS4 peptide specific to which 
comprises the HCV-4 sequence substantially from position 
1691 to 1708 or the sequence substantially from position 
1710 to 1728. 

17. A fusion peptide which comprises at least two of the 
antigens of claims 9 to 16. 

18. A fusion peptide according to claim 17 which 
comprises at least one of the antigens of claims 9 to 16, 
fused to fi-galactosidase, GST, trp or polyhedrin coding 
sequence. 

19. A peptide according to any of claims 9 to 18 which is 
labelled. 

20* A vaccine formulation which comprises an antigen 
according to any of claims 9«18. 

21. Antibody to the antigenic peptide of any of claims 9 
to 19- 

22. An immunoassay device which comprises a solid 
substrate having attached thereto antigen according to any 
of claims 9 to 19. 

23. A device according to claim 22 for HCV screening 
which comprises a mixture of antigens. 
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24. A device according to claim 23 wherein the antigens 
are from the core and NS4 regions. 

25. A device according to claim 22 for HCV-typing which 
con^trises a plate having a series of locations 
respectively containing HCV-1, HCV-2, and HCV-3 specific 
antigens. 

26. A device according to claim 25 which further 
comprises a location containing HCV-4 specific antigen. 

27. An immunoassay device which comprises a solid 
substrate having attached thereto antibody according to 
claim 21. 

28. A method of in vitro testing for HCV which comprises 
reverse transcribing any HCV polynucleotide present and 
ajnplifying by polymerase chain reaction {PCR) , and 
detecting the amplified HCV polynucleotide employing an 
HCV-2, HCV-3 or HCV-4 specific polynucleotide probe. 

29. A method of in vitro testing for HCV according to 
claim 28 wherein the primers employed for PGR are selected 
from those set out in Table 1. 
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30. A method of in vitro HCV typing which comprises 

- carrying out endonuclease digestion of an HCV- 
containing sample employing scrPI or Haelll/Rsal 
endonuclease; and 

- comparing the restriction patterns with 
characteristic type-specific patterns. 

31. A method according to claim 30 or 31 which further 
employs Hinfl in a separate or combined digestion. 

32. A method of in vitro HCV typing which comprises 

- carrying out endonuclease digestion of an 
HCV-containing sample employing ScrFI endonuclease, 
the restriction pattern being characteristic of 
HCV-1, HCV-2 and HCV-3; 

- carrying out endonuclease digestion employing Hinfl 
endonuclease, the restriction pattern being 
characteristic of HCV-4. 

33. A method of in vitro screening a sample for HCV 
antibodies which comprises; 

- carrying out an immunoassay employing an antigenic 
peptide according to any of claims 1 to 19 or mixture 
thereof ; and 

- detecting any antibody - antigen complex produced. 
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34. A method of in vitro HCV antibody typing which 
comprises separately contacting a potentially 
antibody^containing sample or samples with a series of 
HCV-*type-specific antigens; and detecting any 
antibody-antigen complex produced with each type-specific 
antigen. 
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FIGURE 2 
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FIGURE 4 
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FIGURE 6 
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FIGURE 9a 



NDCLEOTIOE SBOtlBNCES 



4911 

T0040 GACACACCC? 
TO 03 a ??CACACCCC 
T0036 77CACACCCC 
T0026 ???7GACCCC 
TIVST ????CACCCC 



GDCACAAAAa 
AUCACAAAAU 
AUCGCGAAAO 
AUCACAAAAt? 



ACJU7CADGGC 
ACAaCAOGGC 
ACAOCADCGC 
ACCOCADGGC 
ACGCTCMGGC 



AaGCACGOCA 
ADGCAUCTCA 
AUGCAOOaCA 
AUGDADGUCA 
AUGCAUGnCA 



GCtJGAP CUGG 
GCUGAUCUGG 
GCUGAUCUGG 
GCUGAUCraGG 
GCOGAaCQGG 



4970 

AAGOAACCAC 
AAGUAACCAC 
AAGOAACCAC 
AAGQAACCAC 
AAGUAACCAC 



Cons GACACACCCU GOCACAAAAU ACAUCATOGC AUGCAUGUCA GCUGAUCaOG AAGOAACCAC 



4971 

T0040 CAGCACCOGG 
T0038 CAGCACCOGG 
TO 03 6 CAGCACCOGG 
T0026 CAGCACCOGG 
T1787 CAGCACCOGG 



GOQUOGCOOG 
GO UUUc ^UUG 
GO GUUGCUOG 
GOGDOGCOOG 
GOC OUG C O O G 



GAGGGGOCCO 
GAGQQ GOC CIT 
GAGGGGOCCO 
GAGGAGOCCO 
GAGGCmOCCO 



CGCGGCCCOA 
CGCGGCCCOA 
CGCGGCCCOA 
CGCOGCCCOA 
CGCGGCCCOA 



GCG6CCOACO 
GCGGCCOACO 
GCGGCCOACU 
GCGGCCOACO 
GCGGCCOACO 



5030 
w 

G CUUGUC AGO 
G LUUUUC AGO 
G uuut iUCAGO 
GCOUGOCAGO 
GCOOGOCAGtJ 



Cons CAGCACCOGG GOGOUGCOOG GAGGGGOCCO CGCGGCCCOA GCGGCCOACO GCOOGOCAGO 

B090 



5031 

T0040 CGGCOGCGOO 
T0038 CGGCOGCGOO 
T0036 CGGCOGCGOO 
T0026 CGGCOGCGOO 
T1787 CGGCOGCGOO 



GOGAOOGOCG 
OTS AOOGOG G 
GOGAOOGOGG 
GOG AUOGOQ Q 
GOGAOOGOGG 



GOCAOAOOGA GCUGGGGGGC AACCCGGCA AOC OOOC CAGA 
GCCAOAOOGA GCUGGGGGGC AAGCCCGCA COCGOUCCAGA 
GCCAOAUOGA GCUGGGGGGC AAGCCGGCA COCGOUCCAGA 
GOCAOAOOGA GCUGGGGGGC AAGCCAGCA COCGOOCCAGA 
GOCAOAOOGA GCOGGO^GGC AAGCCGGCA COCGOOCCAGA 



Cons CGGCOGCGOO GOGAOOGOGG GOCAOAOOGA GCUGGGGGGC AAGCCGGCA AOCGOOCCAGA 



5091 
w 

T0040 CAAAGAGGOG 
T0038 CAAAGAAGOG 
T0036 CAAAGAGGOG 
T0026 CAAAGAGGOG 
T1787 CAAGGACGOG 



OOGOAUCAAC 
OOGOAOCAAC 
OOGOAUCAAC 
OOGOAUCAAC 
UUGUAOCAAC 



AAOACGAOGA 
AAOACGAOGA 
AAUACGAUGA 
AAOACGAOGA 
AAUACGAUGA 



GAOGGAGGAG 
GAOGGAGGAG 
GAOGGAGGAG 
GAOGGAGGAG 
GAOGGAGGAG 



UGCUCGCAA 
UGGUCGCAA 
UGCUCGCAA 
UGCUCGCAA 
UGCUCGCAA 



5150 
▼ 

GCUGCCCCAOA 
GCCGCCCCAUA 
GCCGCCCCAUA 
GCCGCCCCAUA 
GCCGCCCCAUA 



Cons CAAAGAGGOG UUGUAOCAAC AAUACGAUGA GAOGGAGGAG UGCUCGCAA GCUGCCCCAOA 

5210 
w 



5151 

TO 040 UAUCGAACAA 
T0038 UAUCGAACAA 
TO 03 6 UAUCGAACAA 
T0026 UAUCGAACAA 
T1787 UAUCGAACAA 



GCUCAGGOGA 
GCOCAAGOAA 
GCOCAGGUAA 
GCUCAGGOAA 
GCOCAGGOAA 



UAGCCCACCA 
UAGCGCAGCA 
UAGCCCACCA 
UAGCCCACCA 
UAGCCCACCA 



GUUCAAGOAG 
GUUCAAGGAG 
GUUCAAGGAG 
GUUCAAGGAG 
GUUCAAGGAG 



AAAGUCCUU 
AAAGUCCUU 
AAAGUC CUU 
AAAGUCCUU 
AAAGUCCUU 



GGAUUGCOGCA 
GGAUUGCUGCA 
GGAUUGCOGCA 
GG AUUG CUGCA 
GGOOOGCUGCA 



Cons UAUCGAACAA GCUCAGGOGA UAGCCCACCA GUUCAAGGAG AAAGUCCUU GGAUUGCUGCA 



5211 

T0040 GCGAGCCaCC 
T003a GCGGGCCACC 
T0036 GCGAGCCACC 
T0026 GCGAGCCACC 
T1787 GCGAGCCACC 



CAACAACAAG 
CAACAACAAG 
CAACAACAAG 
CAACAACAAG 
CAACAACAGG 



CU GUOA UUGA 
CU GUUA UUGA 
CUGDUAUUGA 
CUGOCAOUGA 
CUGUCAUUGA 



GCCAAUAGUA 
GCCCAUAGUA 
GCCAAUAGUA 
GCCCAUAGUA 
GCCCAUAGUA 



GCUACCAAC 
GCUACCAAC 
GCUACCAAC 
GCUACCAAC 
GCUACCAAC 



5270 

UGGCAAAAGCU 
UGGCAAAAGCU 
UGGCAAAA77? 
UGGCAAAA??? 
UGGCAAAAGCU 



Cons GCGAGCCACC CAACAACAAG CUGUUAUUGA GCCCAUAGUA GCUACCAAC UGGCAAAAGCU 
5271 

T0040 UGAGACC 
T0038 UGAGGCC 
T0036 ??7??7? 
T0026 ??????? 
T1787 OGAGGCtT 



Cons UGAGACC 
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PIGUBE 9b 



T0040 
T0038 
T0036 
T0026 
T17S7 



1638 

THFVT KYIMA 
.HPVT KYIMA 



CMSAD LEVTT 
CMSAD LEVTT 
CMSAD LEVTT 
CMSM) LEVTT 
CMSAD LHVTT 



STWVL LGGVL AALAA 
STWVL LGGVL AALAA 
STWVL LGGVL AALAA 
STWVL LGGVL AALAA 
STWVL LGGVL AALAA 



YCLSV GCWI 
YCLSV GCWI 
YCLSV GCWI 
YCLSV GCWI 
YCLSV GCWI 



1787 

VGHIE 
VGHIE 
VGHIE 
VGHIE 
VGHIE 



Cons THPVT KYIMA CMSAD LEVTT STWVL LGGVL AALAA YCLSV GCWI VGHIE 



1688 
▼ 

T0040 LGGKP 
T0038 LGC^ 
T0036 LGGKP 
T0026 LGGKP 
T1787 LGGKP 



AIVPD KEVLY QQYDE 
ALVPD KEVLY QQYDE 
ALVPD KEVLY QQYDE 
ALVPD KEVLY QQYDE 
ALVPD KEVLY QQYDE 



MEECS QAAPY 
MEECS QAAPY 
MEECS QAAPY 
MEECS QAAPY 
MEECS QAAPY 



lEQAQ VIAHQ 
lEQAQ VIAHQ 
lEQAQ VIAHQ 
lEQAQ VIAHQ 
lEQAQ VIAHQ 



1737 
▼ 

FKEKV LGLLQ 
F^CV LGLLQ 
FKEKV LGLLQ 
itKEKV LGLLQ 
FKEKV LGLLQ 



Cms LGGKP ALVPD KEVLY QQYDE MEECS QAAPY lEQAQ VIAHQ FKEKV LGLLQ 
1738 



1765 

T0040 BATQQ QAVIE PIVAT NWQKL ETFWH KHM 
T0038 RATQQ QAVIE PIVAT NWQKL EAFWH KHM 

T0036 RATQQ QAVIE PIVAT NWQ 

T0026 RATQQ QAVIE PIVAT NWQ 

T1787 RATCXJ OAVIE PIVAT NWQKL EAFWH KHM 



Cons RATQQ QAVIE PIVAT NWQKL EAFWH KHM 
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